Gene TM1040_3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3087 
Symbol 
ID4075533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp54543 
End bp55976 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content60% 
IMG OID638004588 
Producthypothetical protein 
Protein accessionYP_611323 
Protein GI99078065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACA TCGACATTGT CAGATCCGAC AACGATCCGC GCCAGAGCGA TCACGGCATC 
CGCACCGATA CCGCGTCGCG CGGGGTGGAC CTCGAAGGGG CGAAGATCAC CGTCACCTAT
ACCGACGGAA CCACCGAGAC GCTCACCTGG AAAGCGCTTG ATCCCTATAC TGCTGGGGGC
GCGACCGGTG ACAATATCGA CATGTACTTC GGCTATGACT GGCACCAGCT GACCACGACA
AAGCCCCTCG CCTCGCTCAA GATCGATCTG GCCCCGGCAA ACTCCGTGTT CGACACAACC
TTTGCCAGTG ATGGCAACCC GAACGATCCC TCCACTCCAG GCTCCAAAGA AGGCTTTCCC
TTCAAGGTCT CGCCCGACTA CGAGGACCTT GGCGGCACGA TCACCGCGAC CTACTCCGGG
ATCGCGGGGC TCGATGGCAA TGCGCCCGTA GGGGATCTCT ACACCACGAT GACGCTGGAT
TTCACCAGCC TGCCCGGCGG TGGCTTGCAG GGGGATCTGG TCTGGAACTC CGATATCGAC
ACGCTTACCG GTCCCTTTGC CTCTTCTGTG ACCGCGACGG ACGACTTTGT TTCCATCTCC
GGGAATGGCT CGGAAGTGAT TGATATTCTC GGCAATGACG CCGGGGCGGG CAAGGGTCCT
CTGACCATTT CCCATATCGC GGGCACCGCA ATATCGGCCG GAGAGTCGGT GACGCTCGCA
AGCGGTGAGG TGATTACCCT CAATCCCGAC GGAACCTTGT CGATCACCAA CGACAGCCCC
GAGGATGAAA CCAACAGCTT CACCTACACA GTGGTGGATG AGGCAGGAAA TACCGCGACC
GGCACCGTCA CGGTTGACAC CAAACTCACA GCGCCCCCCT GCTTTGTCGC GGGCACGTTG
ATCGACACCG AAGCGGGGCC GATCGCGGTC GAAGATCTCA CCGTCGGCAT GCAGGTGATG
ACGCGGGACC ATGGGCCACA ACCACTGCGC TGGATCGGTC GCAGCACCCG ACTCGCGCGG
GGCCGCAACG CACCAGTGCT GATCGCCCGC AACGCTTTGG GCCACCATGG CCAGATCGCA
CTTTCGCCCA ATCACCGCGT CCTGATCCGA TCAGAGCTCG CAGATCTGCA ATTTGGCCAG
CCCGAGGTGC TCATCAAGGC GAAGCATCTG GTCAATGGAG ACAGCATTCG GATTCTCAGC
GATGGCACGC AGGTGTGCTA TGTGCACATC CTCTTTGACC GCCACGAGAT CGTGCGTGCC
AATGGTCTGG ACAGCGAGAG CTACCACCCC GGCGCTGAAA CCCTTGGTGC TTTTGACGCG
GACACCCGAG CCGAGATTCT CGACCTCATG TCAGACTGGC AAGAGTATGG ACCTGCTGCA
CGGATCACGC TCAAGCCGCG CGAATGTGCG CTGCTTCAAC TAAACCGACA CTAA
 
Protein sequence
MPDIDIVRSD NDPRQSDHGI RTDTASRGVD LEGAKITVTY TDGTTETLTW KALDPYTAGG 
ATGDNIDMYF GYDWHQLTTT KPLASLKIDL APANSVFDTT FASDGNPNDP STPGSKEGFP
FKVSPDYEDL GGTITATYSG IAGLDGNAPV GDLYTTMTLD FTSLPGGGLQ GDLVWNSDID
TLTGPFASSV TATDDFVSIS GNGSEVIDIL GNDAGAGKGP LTISHIAGTA ISAGESVTLA
SGEVITLNPD GTLSITNDSP EDETNSFTYT VVDEAGNTAT GTVTVDTKLT APPCFVAGTL
IDTEAGPIAV EDLTVGMQVM TRDHGPQPLR WIGRSTRLAR GRNAPVLIAR NALGHHGQIA
LSPNHRVLIR SELADLQFGQ PEVLIKAKHL VNGDSIRILS DGTQVCYVHI LFDRHEIVRA
NGLDSESYHP GAETLGAFDA DTRAEILDLM SDWQEYGPAA RITLKPRECA LLQLNRH