Gene TM1040_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0404 
Symbol 
ID4078798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp413522 
End bp414805 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content63% 
IMG OID638005699 
Productcytochrome c, class I 
Protein accessionYP_612399 
Protein GI99080245 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat
[COG3474] Cytochrome c2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.06087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGC GACTGGCTCT TGGCCTGCTC CTCTGCGCCG CCTCTGGGGT GAGCGCGGAG 
GAGTTCCAGA CCCTCAAAGG ACACGGAGGG CCGATCATGG CGCTTGCCGT GAACGATCGG
GGGCACGTGG CCAGCGCCAG TTTCGACAAT TCCGTCTCAC TCTGGCAGGA GGGGGCGCCG
AGCTGGCTCG AGGCACATGA AGCAGCCGCG ACCGTGGTGG CCTTTGGCCC TGATGATACC
CTGTTCAGCG CAGGCGACGA CTTTGTGATT TATCGCTGGC AGCAGGGCCA CCCCCAAGAG
ATCGGACGGC ATACCGCCAA GATCCGCGCC TTGGACCTCT CGCGCGATGG GGAATGGCTC
GCCTCCGCAA GCTGGGATGG CGGCATTGGT CTTTGGCCTA TGAGTGCGGG CACGCCCCGC
CGCATCGCGG TTGGAACGGG CGTGAACGAT CTTGCCTTTG ACGGGGCCGG TCGCCTCTTC
GTCGCCACCA TGACCGGGCA GATCCAAGTC TTCGACAGCC CGGAGGCTGC CCCACGAATT
CTGGCGGAAC AAGGGTTTGG CATCAACCGT TTGGTGCTCT CTGCTGCTGG CTGGCTGGCC
TATGGCGCCG TTGATGGCGG CACCCGTGTG ATCAACGCAG AAACCGGGGC CGAGATTGCC
GATTTCACCC TCGACCGGCG GCCCATACTG GCCCTTGCGC ATCATGCCGA GAGCCAGCAG
ATCGCCGTGG GTGATGGGCA TGGCTATATC ATGATGATCG ACACACACGA CTGGAGCATT
GCGCGCGATT TTCGGGCCAT GCGCGAAGGC CCCGTCTGGG CTCTGGCGTT TTCAAAGGAC
GGCCAGCGGG TCTGGGCAGG CGGCATACAC GATGTGATCT ATGGCTGGCC CATCGCGCTG
ATGGCCAGCA GCCCAGCGGC GGGAACCGAG ACCCGCACAT TCCTGCAGGC GCCTGAAACC
ATGCCCAATG GTGAGCGCCA ATTCATGCGA AAATGCTCGG TTTGCCACGA TTTGGTCGCC
ACAGAGCAGC GGCGCGCCGG TCCTCATCTG GCGGGGCTCT TTGGACGACC GGCCGGCAGT
CTGCCTGGCT ATCGGTATTC CGACACGCTG GCGCAGTCTG ACATCATCTG GGGTGCCGAG
ACCATCGATG CCCTGTTCGA TCTCGGCCCT GACCATTATA TTCCGGGGTC CAAGATGCCG
ATGCAGCGCA TCACCGCCCC CACAGATCGC CAAGATCTGA TAGACTATCT GAAAACCGCG
ACACAACTTT CGGAGGATAA TTGA
 
Protein sequence
MLRRLALGLL LCAASGVSAE EFQTLKGHGG PIMALAVNDR GHVASASFDN SVSLWQEGAP 
SWLEAHEAAA TVVAFGPDDT LFSAGDDFVI YRWQQGHPQE IGRHTAKIRA LDLSRDGEWL
ASASWDGGIG LWPMSAGTPR RIAVGTGVND LAFDGAGRLF VATMTGQIQV FDSPEAAPRI
LAEQGFGINR LVLSAAGWLA YGAVDGGTRV INAETGAEIA DFTLDRRPIL ALAHHAESQQ
IAVGDGHGYI MMIDTHDWSI ARDFRAMREG PVWALAFSKD GQRVWAGGIH DVIYGWPIAL
MASSPAAGTE TRTFLQAPET MPNGERQFMR KCSVCHDLVA TEQRRAGPHL AGLFGRPAGS
LPGYRYSDTL AQSDIIWGAE TIDALFDLGP DHYIPGSKMP MQRITAPTDR QDLIDYLKTA
TQLSEDN