Gene TM1040_2977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2977 
Symbol 
ID4078007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3140954 
End bp3142540 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content59% 
IMG OID638008306 
Producthypothetical protein 
Protein accessionYP_614971 
Protein GI99082817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.618788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.601825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA AAATCCTTAC GACACAGTCT GTTGGCAGTA CGTCAACTGA GTCGTCCAAG 
CGAACGGAGC CCGCGCGCCG AGGCGGTGAT TCATTCGAGT CCGTTCTTGC CCGCGAGGCA
AAAACAAAAG ACAAAACCAA GACCTCTCCG GATGTCGAGA CACCTTCTGG CGCAGAGCAT
GACGACAAGG CTTTTGACGC TAAGAATAAC GCGAAGGCCA CCGCAGCTGA CCAGCATGCG
AAGGGTGAGG AATCGCATCC AGAGCAGGAC GCGGGTGAAA CGTCCCTGGT TGCTCCGGAC
GAAAACCCTG AGAGCGAATG GGCAAGTGTC ATCGACACCG CAACACAGCC CGACGTCGAG
GAAGCCCTGG ACGAGGCGCT CCCCGAGCTT GATACATCCG CAGAGGATTT TGCGCAGGCG
GATGTGGCAG AGGGTCAGGC AGTTGCTGCT GCAGGCGCCA CACAGGTGTC ATCTGATCAA
TCTGCAAAGG GCTCCGGCGA TCAGGTGATG CGCGCGGGCG CTCCTGTGAC GCATGGCTCT
GCCACTGACA AAGCGGTGGA GGTGGCCATA AAAGGTGCTG CCTCGGCTGA GGCAGATGCC
GAAATGAATT CAATGGAGGC GGCGCGTACG GGGCTCTTGT CAGATAAAGA CGAACGTCTC
GGCACATCAG TTCTCTTGGA AACACATCGA GATGGGCGAC GCCAACAGCT TGTGCCCGGA
ACAGTCGGCG CGATTGCTGC CGAAAAAGGT GCGCAGCCGC CTGCGCAGGC ACAGGCTCAG
ACCGCTCTGG AGGCGCTGAA AACAGGGGCG CCCACATCTC CTCCGGGACA GGAGAAAGCC
ACTGAAGCGC GCATCCGTGA TATACCATTG ACCGCAGCTC AGGCACAGGT CGCGGCTTCC
ACAGCGACTG CGCGCGCGCC AATGAACAGT GGTGATACCC GCCTTTTGCA CCCTGCGGCC
TCGGGGAGTG CGCAGGCCTT GGCCTCCGCG CGCTTCCAGA TGAGTGATAC GGTCCTCAAG
TCAAGCTCCG CCGTGGTGAC CAGCCTGATG GGAGCAAATG ATGCAGGACG CGTAGGAGAT
GACGTTCTGA CGCAACGCGG TGCGGAGAGC TTTGCGTTGC CACAACTTTT GGCGGAGGCC
TCCGTCAGAT CTGGAGCATC CAGCTTTCGC GCCGAAACGC CCCGGCATGT AGCGCAACAA
CTTGCAGAGG CCGTCGCGAC GGGCGGCAAA CGCAATGTGG ATGTCACACT GAACCCGCGT
GAGCTGGGCC ATGTGAACAT GCGGGTGATG ACAACAGAAA TGGGCGTCAC GATCACCATC
AACGCCGAAC GTCCTGAGAC CGAGGACCTG ATGCGTCGTC ACATCCAGGA TCTTGCCCGC
GAATTCAAGG AAATGGGCTT CACCGATATC TCTTTCCAAT TTGGCTCTGA CACCGACGCC
GGTCAGTCAG GGGAGGGAGA GAGCAGTCTT GGGGGCAACG GATCCGAGCA GCAAGGCGAG
GGCGATGCCC TTGAAGCCGC TCAGTCCGGT CTTCCGATAT CACAACACTT GAACATCTCG
GCCGATGGCC TGGACATGAG GATTTAA
 
Protein sequence
MIDKILTTQS VGSTSTESSK RTEPARRGGD SFESVLAREA KTKDKTKTSP DVETPSGAEH 
DDKAFDAKNN AKATAADQHA KGEESHPEQD AGETSLVAPD ENPESEWASV IDTATQPDVE
EALDEALPEL DTSAEDFAQA DVAEGQAVAA AGATQVSSDQ SAKGSGDQVM RAGAPVTHGS
ATDKAVEVAI KGAASAEADA EMNSMEAART GLLSDKDERL GTSVLLETHR DGRRQQLVPG
TVGAIAAEKG AQPPAQAQAQ TALEALKTGA PTSPPGQEKA TEARIRDIPL TAAQAQVAAS
TATARAPMNS GDTRLLHPAA SGSAQALASA RFQMSDTVLK SSSAVVTSLM GANDAGRVGD
DVLTQRGAES FALPQLLAEA SVRSGASSFR AETPRHVAQQ LAEAVATGGK RNVDVTLNPR
ELGHVNMRVM TTEMGVTITI NAERPETEDL MRRHIQDLAR EFKEMGFTDI SFQFGSDTDA
GQSGEGESSL GGNGSEQQGE GDALEAAQSG LPISQHLNIS ADGLDMRI