Gene TM1040_1227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1227 
Symbol 
ID4075935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1320632 
End bp1321807 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content61% 
IMG OID638006535 
ProductFmu (Sun) 
Protein accessionYP_613222 
Protein GI99081068 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGG GTGCGCGCGT ACAGGCGGCT ATTGAAATTC TCGATGAAAT CCTCGCGGGT 
CAGGCTGTGG AAAAGACCCT GACCAACTGG GCGCGGCGCA GTCGATTTGC CGGATCAAAA
GACCGCGCCG CGGTGCGCGA TCACGTCTAT CAGGCGCTTC GGTGCCGCCG GTCCTATGCG
GTTCTGGGGG GAAGTGAGAC CGGACGGGGG CTGATGCTTG GCGCTTGCAA GGATCAAGGG
CTTGATCAGG CAGTGCTCTT TCACGGTGAA GGGCATGCGC CGTCCCCCCT CAGCGCCGCC
GAACAGAGCA TCGCGCCCGA ATTTCACAGC GACGCAGAGC GTCATGATAT CCCGGAGTGG
CTCTGGCCCG TGTTTTCACG CAGCCTCGGC ACTGAGGCGA TTGCAGCGGC AACCGCGCTC
AGGTCTCGAG CGGCAGTTCA TCTGCGGGTG AACCTCTTGA AAGGAGATCG CGACAACGCC
ATCAAGCGGC TCACGCGGGA GGGCATTGCC ACAGAGCCGC ATCCAGCCTC GCCGACCGCC
CTGACCGTGA CAGAGGGGGC GCGCCGCATA AAAAACGCAG AAAGCTACCT GCAAGGCTTT
GTGGAATTGC AGGACGCTGC CAGTCAGGCA GTCGTCGATA AGCTGCCAGT GCAAAATGCC
CCGAGGATAT TGGACTATTG TTCCGGCGGT GGAGGCAAGG CTCTGGCGAT TGCAGCACAG
ACCCAGGCTG AGGTCTATGC GCATGATGCA GACCCACGAC GCATGCGCGA CATTCCCGAA
CGCGCCATGC GGGCGGGGGC GGATATTCGC TGCCTTACCT CCGAGGAGCT TGTAACGCAG
GCGCCGTTTG ACCTCGTGCT CTGTGATGCG CCCTGCAGTG GTAGCGGGTC TTGGAGGCGT
GATCCCGAGG GTAAGTGGCG CCTCACGCAG GACACTCTTG ATGACACCGT AGCGCTGCAG
GCCCGAATTC TGGATGAAGC TGCTCAACGC GTCGCGCCGG GGGGCGTCCT GGCCTTTGCG
ACCTGTTCGA TGCTGGATGT GGAAAACAGC CTGCAGACAC AGCGCTTTCA GGAGCGGCAC
ACCGGCTGGG CGCACTTGTC TGAAACGGCA TGGCATGTGC ATAGTGGAAC AGACGGATTT
TACGTATCGG TGTTTCGGCG GAATGGCACA GAATAA
 
Protein sequence
MTPGARVQAA IEILDEILAG QAVEKTLTNW ARRSRFAGSK DRAAVRDHVY QALRCRRSYA 
VLGGSETGRG LMLGACKDQG LDQAVLFHGE GHAPSPLSAA EQSIAPEFHS DAERHDIPEW
LWPVFSRSLG TEAIAAATAL RSRAAVHLRV NLLKGDRDNA IKRLTREGIA TEPHPASPTA
LTVTEGARRI KNAESYLQGF VELQDAASQA VVDKLPVQNA PRILDYCSGG GGKALAIAAQ
TQAEVYAHDA DPRRMRDIPE RAMRAGADIR CLTSEELVTQ APFDLVLCDA PCSGSGSWRR
DPEGKWRLTQ DTLDDTVALQ ARILDEAAQR VAPGGVLAFA TCSMLDVENS LQTQRFQERH
TGWAHLSETA WHVHSGTDGF YVSVFRRNGT E