Gene TM1040_3712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3712 
Symbol 
ID4075419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp770332 
End bp771564 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content62% 
IMG OID638005232 
Producthypothetical protein 
Protein accessionYP_611941 
Protein GI99078683 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0523031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTG AAAAGACGAT CCTTGTGGCC GGGACCTGGG ATACCAAGGA TGATGAGCTG 
TCTTATTTAT CGGAAGTGAT CCGGGGGCAG GGCGGTCAGG TGCTCAGCAT GGATGTGAGT
GTGTTGGGCG AGCCCAAACT GCCCACGGAT GTCTCAAAAC ACGACGTTGC CGAGGCGGCG
GGCAGTTCCA TTCAGAGGGC CATCGACAGC GGAGATGAAA ATACCGCGAT GCAAATCATG
GGGGCGGGCT CGGCCAGGCT TGCGCTGGAT CTGTGGCGCG CGGGGCGCAT CCATGGCGTG
ATCGTGCTCG GGGGCACCAT GGGCACCGAT CTTGCGCTCG ACCTCTGTGC TGCGTTGCCT
TTGGGGGTGC CCAAATATGT CGTCTCGACC GTGGCATTCT CGCCGTTGCT GCCACCGGAG
CGCATCCCGG CGGATCTGCA GATGATCCTT TGGGCCGGGG GGCTCTATGG ATTGAACGAC
ATCTGCAAAG CATCGCTCAG TCAGGCTGCG GGTGCCGTTC TGGGCGCCGC GCGCGCGGTG
GAGGCGCCCA GTTTTGAGCG TCCGATGGTG GGCATGACCT CCTTTGGAAA GACGGTGCTG
CGCTACATGG TGACGCTTAA ACCAGAGCTT GAGAAGCGCG GATTTGATGT GGCGGTCTTT
CATGCCACCG GCATGGGCGG GCGCGCCTTT GAGAGCCTTG CGGGGGAGGG CGCTTTTGCG
GCGGTGATGG ATTTTGCCCC TCAAGAAGTG AGCAATCATC TCTTTGGCGG CTTGTCGGCG
GGCGAGGGGC GCATGACACA CGCCGGGCAT GCGGGTGTCC CGCAACTGAT TGCGCCGGGA
TGCTATGACC TTGTGGATTT TGTCGGCTGG CAGGGTGCGC CGGAGCAACT GCGCGGACGG
GAGTGCCACG CCCATAACCG CTTGCTGACG TCGGCCATGC TTGATGCGCG AGAACGACAG
CGCGTCGCGC AAGAGATGTG CAACAAGCTT GCCCGGGCCT CAGCACCAGT CACGGTGTTC
TTGCCCCGCG CGGGCTGCAA CGAATGGGAC CGCGCCGGCG GCGATCTGCA TGATGCGGAA
GGGCTTCGGG CCTTTTGCGA TGAGATGCGT CGCGGAGTTC CGGAGAACGC GCAACTGCAG
GAGCTCGACT GCCACATCAA TGACGCCGAA TTCACCAATG CGGTGCTGGC ACAGTTTGAT
GCCTGGATCA AAGAGGGCGT GATCGTGCGC TGA
 
Protein sequence
MNGEKTILVA GTWDTKDDEL SYLSEVIRGQ GGQVLSMDVS VLGEPKLPTD VSKHDVAEAA 
GSSIQRAIDS GDENTAMQIM GAGSARLALD LWRAGRIHGV IVLGGTMGTD LALDLCAALP
LGVPKYVVST VAFSPLLPPE RIPADLQMIL WAGGLYGLND ICKASLSQAA GAVLGAARAV
EAPSFERPMV GMTSFGKTVL RYMVTLKPEL EKRGFDVAVF HATGMGGRAF ESLAGEGAFA
AVMDFAPQEV SNHLFGGLSA GEGRMTHAGH AGVPQLIAPG CYDLVDFVGW QGAPEQLRGR
ECHAHNRLLT SAMLDARERQ RVAQEMCNKL ARASAPVTVF LPRAGCNEWD RAGGDLHDAE
GLRAFCDEMR RGVPENAQLQ ELDCHINDAE FTNAVLAQFD AWIKEGVIVR