Gene TM1040_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3492 
Symbol 
ID4075132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp524505 
End bp526412 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID638005007 
ProductBeta-galactosidase 
Protein accessionYP_611726 
Protein GI99078468 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA CCCTTGGCAC CTGCTATTAC CCCGAGCACT GGCCAGAAGA GATCTGGGCC 
GAAGACGCCG CGCGTATGAA AGCTGCGGGC CTGACGTGGA TCCGGATTGG GGAATTTTCG
TGGTCGAAAC TCGAACCCAC TCCCGGCGAT CTGCACTGGG ACTGGCTCGA CCGCGCGATC
GAGACATTGG GGGCGCAAGG CTTGCGGGTG GTGCTCGGCA CCCCCACAGC GACCCCGCCG
CGCTGGATGG CCGAGCGTCA CCCGGACATG TTTGCTGTCA CCGCCGAGGG CCAGCCACGC
GGCTTTGGCT CCCGGCGGCA CTATTGCTTT AGCCACAAGG GTTATTTTGC CGAGAGCCAG
CGCATCACCC GCCTGATGGC AGAGCGCTAT GGTGCCAACC CGCATGTGGC CGCGTGGCAG
ACCGACAATG AATATGGCTG CCATGATACC GTGATCAGCT ATTCCGACGC GGCTCAAACC
GCTTTTCGGG CGTGGCTGGC AGAGCAGTTC GACGGCGAGA TCGGCGCGCT CAACGCGGCC
TGGGGCAATA TGTTCTGGTC CATGGAGTAT CGCAGCTTTG ACGAAATCGG CCTGCCCAAC
CTCACCGTGA CGGAGCCGAA CCCGGCGCAT GTGCTGGCGT TCAGACGCTT CAGCTCCGAT
CAGGTGGTGG CTTTCAACCG CGCGCAGGTC GAGATCATCA AGGCCCATTC AACCGCGCCG
ATTTCTCATA ACTACATGGG GCGGATCACC GATTTCGACC ACTTCAAACT AGGTGAGGAT
CTCGAGATCG CGACCTGGGA CAGCTACCCG CTGGGCTTTC TGGAAGACCG CGTGGGGGCC
TCACCCGAGG AACAGCGCGC TTATGCCCGG CAGGGGGATC CGGATTTTCA GGCCCTTCAT
CACGATCTCT ATCGCGCGGT TGGGCGCGGG CGCTGGTGGG TCATGGAACA GCAGCCGGGG
CCAGTGAACT GGGCGCCCTA CAACCCGGCA CCCCTGCCGG GCATGGTGCG GCTCTGGACC
TGGGAGGCCT TTGCCCATGG CGCCGAGGCT GTGTGTTATT TCCGCTGGCG GCAGGCGCCT
TTTGCGCAGG AACAGATGCA CGCAGGCCTC TTGCGTCCCG ACAGCCAGGA CGCCCCCGCC
ATGCAAGAAG CGATGGATGT TGCGGCAGAG CTTGGCGCGG CAGCCGATGT GCAGCCCGCG
CAGGCACCGG TGGCGATCCT TTTTGATTAC GATGCCGATT GGGCGTGGTC GACGCAGCCG
CATGGTGCAG GGCTGAGCTA TTTCCAGCTG ATCCTCGAAC ACTACAAGGC GCTTCGGCGC
GCTGGTCAGA CCATCGACAT CCTGCCCCCG GAGACCCGCG ATTTCACGGG GTACAAGATG
ATCCTTGCGC CCGGGATGAT GCATCTGCCA GAGCCCCTCA AAGAAGCGCT CGCAAGAAGT
GAGGCCGAGG TGCTCTATGG TCCGCGCAGC GGTGCGCGTG ACGGTCATTT CTCCATCCCG
ACCAGCCCAC TGCCACCTGC ATTGCCGGGG CTGGACGTGA CCGTGGCACG GGTGGAGAGT
CTGCGCCCGG ATATGCCCAT CGCCCTTAAG GGCGGTGGTG CAGTGCGCGG CTATCTTGAG
GAGCTTGAAG GCACTGCAGA AGTGGTCTTT GAAACCTCCG AGGGCGCGGC GGTCGCGCTC
CGGGCCGGGC GACAGACTTA TTGTGGCGGC TGGCTCGATG CAGAGGGGCT TGATCGGTTG
ATTGCCACCG TTGCGCAGGC GGCGGGTCTG GAGTTGCGCC AGATGCCGGA AGGGGTGCGC
ACCCGCCGCA CGGCAACCGA GGTCTTCTGG TTCAACCACA GCGCAGAGCC TGTCGAAACC
GAAGTTGGCC TCTTGCCTCC GGCGGGGGTG AAACGGATCG CGCTTTAG
 
Protein sequence
MKRTLGTCYY PEHWPEEIWA EDAARMKAAG LTWIRIGEFS WSKLEPTPGD LHWDWLDRAI 
ETLGAQGLRV VLGTPTATPP RWMAERHPDM FAVTAEGQPR GFGSRRHYCF SHKGYFAESQ
RITRLMAERY GANPHVAAWQ TDNEYGCHDT VISYSDAAQT AFRAWLAEQF DGEIGALNAA
WGNMFWSMEY RSFDEIGLPN LTVTEPNPAH VLAFRRFSSD QVVAFNRAQV EIIKAHSTAP
ISHNYMGRIT DFDHFKLGED LEIATWDSYP LGFLEDRVGA SPEEQRAYAR QGDPDFQALH
HDLYRAVGRG RWWVMEQQPG PVNWAPYNPA PLPGMVRLWT WEAFAHGAEA VCYFRWRQAP
FAQEQMHAGL LRPDSQDAPA MQEAMDVAAE LGAAADVQPA QAPVAILFDY DADWAWSTQP
HGAGLSYFQL ILEHYKALRR AGQTIDILPP ETRDFTGYKM ILAPGMMHLP EPLKEALARS
EAEVLYGPRS GARDGHFSIP TSPLPPALPG LDVTVARVES LRPDMPIALK GGGAVRGYLE
ELEGTAEVVF ETSEGAAVAL RAGRQTYCGG WLDAEGLDRL IATVAQAAGL ELRQMPEGVR
TRRTATEVFW FNHSAEPVET EVGLLPPAGV KRIAL