Gene TM1040_3124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3124 
Symbol 
ID4074995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp98105 
End bp99355 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content49% 
IMG OID638004626 
Producthypothetical protein 
Protein accessionYP_611360 
Protein GI99078102 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.610314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAG AGGTCGCAAT TATTAATAGG TCGGGCATCG CTCTGGCCGC CGACAGCGCC 
GTAACGATCG GCCGCGACCG GGTATGGAAG AACTCCAACA AACTTTTTCA TCTAGCACCA
TCAAACGACG TTGCAGTCAT GGTCTTCGGA AGCGGGGACT ACTGCGGCTT GCCTTGGGAA
GTTGTGATTA AGGAATTTAG AAAAAGCCTC GGGAAAACAA CATACTCCAG GGTAGAAGAG
TATGTTGGGC GCTTTCTGGG TTTTTTGGAC GACTTGGTTG TTCCTCCAAC ACCGCTGGCC
GATTTGACCG GATGGTACAT AATCCTTAAC GCCATCAGCC AGACCCAAAA AGCTATGACT
GCCAGTGGTT CGCTAAAACG TCGCCAACAG CTTATCGCTG CAATCTCCGA GAAAATCGAA
GAAGCGGATC ACTATCCCCT CCTCTTCGAC GGATACTCTC GTGATCAGTA CCGCAAGAAA
CACTCCCAAA AGATCAAAGA GTTTATGGCG GAAGAGCTTG GAATGCATGT CACCCAGACC
ATGCACTCCA AGATGATAAC GCTTTGCTAC GAACGGTCGC GGAGAGCTTT CGAAACGAAG
TTCGAGACGG GGGTTGTCTT CGCCGGATAT GGCGACTCGG AACTCCTGCC TGTCGTTATC
GAAATGTGTG TCGATGGCGA ATTAGAAGGG AAGGTCAGAG CCTGGCAGGT TCGTGAAAAC
AACATGAATG AAGGCGGAAC TTCTGGCGCC ATCCTCCCTT TCGCACAAGC CGATGTAGCC
AATCTTTTTG TGGAAGGTAC GCTACCACAA TATCTGAGTT ATACTCGACA GACCCTTCTG
CAGACCCTCG ACCTGAAAAC TGCAGAACTT GTTAAAGACT ATGTCCCAGA ACCAGATCGC
GTTGTCGAAA TGGAACGGCA AAAGAAAGCC AACCGCGCTA TGGTTAAGCA GTTTTCAACC
GACTTCAAAC AATATCGGCA CGACGAATCT GTCGCCAACC TTCTAAAAGT GGTAAACTCT
CTACCCAAAG AGGAAATGGC GGCTATGGCT GAGGCTCTTG TGGAGATTAC CTCTTTGCGG
AGAAAGATGG ATTCATCACT TGAGACTGTA GGTGGCCCTG TCGACGTTGC GATTATCTCA
AAGTCGGACG GGTTTGTCTG GACAAAGCGA AAGCACTACT TTGATGTTGA ATTCAACAGA
GATTTCATGG AAAGGCGCAA CCAAAGGTAT CAGGGGAACC AAGATGCGTA G
 
Protein sequence
MTAEVAIINR SGIALAADSA VTIGRDRVWK NSNKLFHLAP SNDVAVMVFG SGDYCGLPWE 
VVIKEFRKSL GKTTYSRVEE YVGRFLGFLD DLVVPPTPLA DLTGWYIILN AISQTQKAMT
ASGSLKRRQQ LIAAISEKIE EADHYPLLFD GYSRDQYRKK HSQKIKEFMA EELGMHVTQT
MHSKMITLCY ERSRRAFETK FETGVVFAGY GDSELLPVVI EMCVDGELEG KVRAWQVREN
NMNEGGTSGA ILPFAQADVA NLFVEGTLPQ YLSYTRQTLL QTLDLKTAEL VKDYVPEPDR
VVEMERQKKA NRAMVKQFST DFKQYRHDES VANLLKVVNS LPKEEMAAMA EALVEITSLR
RKMDSSLETV GGPVDVAIIS KSDGFVWTKR KHYFDVEFNR DFMERRNQRY QGNQDA