Gene TM1040_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0476 
Symbol 
ID4078514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp495160 
End bp496929 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content62% 
IMG OID638005772 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_612471 
Protein GI99080317 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCG CATTTTACGG ACTATCCACT CGGATCTACG CCCTTGTAGC GCTTGCGCTC 
GTCGCGGCCG CAAGCTTGAC TTTCTACACC CTGAACCATG CCTCTCAGAC CGCCTATGAT
CTTCGGGCGG GAGAGTTGCG CCACATAACA GACATCGCCC GCAGCCATCT GGACGCCCAT
AATGCCCGCG TCGAAGCCGG TGAAATGACC CAGGCCGAGG CGCAAGCCTC GGTGGCTCAA
ATTCTCAATG ATCTGCGCTT TGGCGAAAAC GGCTATGTCT ATGCCTTTGA CAGCGACATC
GTCTATTCCG TCCACCCCTT CCGCCCGCAA TGGGTTGGAG GTGAGAGCCA GCGCGACCTG
AAGGACGTGC ATGGCAACCG GATCTTCGAA AACATCCTCG GCAATCTGGA TGCTGAGGGG
CACTCGCTCT CGACGATCTA TTTCGAGAAC CCCGCGACCA AAGCCGTCGA ACCCAAGCTC
ACATACGCAC AATATTATGA ACCCTGGGAC TGGTACCTTG GGACCGGCGC CTACATGAAC
GACATTGAGG CCGACATCGC GCAAATGCGG ATGCAGGCCC TGATGGGGCT TGGTGTTGCC
CTCGCCGTGT TGATCGGCGT ATCGCTCTTG ATTATCCGCA GCATGCTGCC GCCTCTTGAT
GCGCTCAAGG CACGTATGGC CACCATGGCC GATGGCGACC TGGACGCGGA TGTGCCGGGC
CTGTCGGATC GCAGCGAGAT CGGCCAAATG GCGCATGCAG TGGCGAATTT CCGGGATGGG
CTGCAAATGC AGGCAGCACT GGAAGCAGAG GCGCACATCA AGGATGAGGA ACGCCAGCGC
GTCGTCGCCA TCCTAAGCCA GCGTCTCGCG AAATTTGCCC AGGGGGATCT CACCGTGCGC
GTCGATGACG CCATGCCCGA CGAGTTCCGG CAGGTGGCAG AAGACTTTAA CGCAACGGTT
ACGCAGATCT CCGACCTGAT GCGCACGCTG GTCGCCGGGG TGGAGCGGAT TGAGAGCGAA
AGCGAATCCC TCGACAGCTC CTCGCGTGAG CTTGGCATGC GTACGGAAAC CCAGGCAGCC
TCTCTCGAAG AAACGTCCGC GGCCCTCACA GAGCTCTCGG CTTCGGTGAA AAACTCTGCC
GAAGAGAGCC GCGCAGCCTC CGAGCGGGTG CAGCAGGCCA GCGAACGGAC CGAGCGCGGC
GCGCAGGTGG TCCGGCGCAC CATTGAGGCC ATGCAGGGGA TCGAGGAAAG CTCCGACAAG
ATCTCGAGCA TCACCTCTTT GATCGACGAT ATCGCCTTTC AGACCAGCCT GCTGGCGCTC
AACGCCGGGG TTGAGGCCGC ACGCGCGGGC GAAGCCGGCC GCGGGTTTGC GGTTGTGGCT
GGCGAGGTGC GCGCTTTGGC CGGGAAATCC TCAGATGCGG CGCGCGAGAT CGCAGAGTTG
ATCACAAACG CCAGCCGCGA GGTGGAGACC GGTGTGTCAC TGGCGCGCGA CTCTGGCACG
GCTCTTGACG AGATCAACGC GCATATCTCC GCCATCAACG AAACCGTCAG CGCCCTTGCT
GAGGCGGCCC GGGAACAAGC CACAAGCCTC TCGGAGATCA CCTCCGCCGC CGACCAGCTT
GATCAGGTCA CACAGCAGAA CGCGGCAATG TTCGAGGAAA CCTCTGCTGC AACCCAGCGC
CTGCGGGATG AGGCAAACGC CCTGGCTCAG AACGCCCGTC AGTTTCAGGT TGATGACGCG
GATCCCGCTG CGCAACGCCG GGCCTCATAA
 
Protein sequence
MPAAFYGLST RIYALVALAL VAAASLTFYT LNHASQTAYD LRAGELRHIT DIARSHLDAH 
NARVEAGEMT QAEAQASVAQ ILNDLRFGEN GYVYAFDSDI VYSVHPFRPQ WVGGESQRDL
KDVHGNRIFE NILGNLDAEG HSLSTIYFEN PATKAVEPKL TYAQYYEPWD WYLGTGAYMN
DIEADIAQMR MQALMGLGVA LAVLIGVSLL IIRSMLPPLD ALKARMATMA DGDLDADVPG
LSDRSEIGQM AHAVANFRDG LQMQAALEAE AHIKDEERQR VVAILSQRLA KFAQGDLTVR
VDDAMPDEFR QVAEDFNATV TQISDLMRTL VAGVERIESE SESLDSSSRE LGMRTETQAA
SLEETSAALT ELSASVKNSA EESRAASERV QQASERTERG AQVVRRTIEA MQGIEESSDK
ISSITSLIDD IAFQTSLLAL NAGVEAARAG EAGRGFAVVA GEVRALAGKS SDAAREIAEL
ITNASREVET GVSLARDSGT ALDEINAHIS AINETVSALA EAAREQATSL SEITSAADQL
DQVTQQNAAM FEETSAATQR LRDEANALAQ NARQFQVDDA DPAAQRRAS