Gene TM1040_3844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3844 
Symbol 
ID4074907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp92309 
End bp93592 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content49% 
IMG OID638004501 
Producthypothetical protein 
Protein accessionYP_611236 
Protein GI99077977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.429225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAA GCGCGAAGCT CCGCATCCAC TGGATTTCAC CAGTCCCGCC GGCTGAAACC 
GATATTGCAC ATTATACTCA CAGAATCTTG CCGGAGCTGG CTGAGGTTGC TGAGCTTGTC
TTATGGACCG ATGCAGCGGA ATGGGACTGG AACCTACATA ATATTGCACC GGTCCGCCAT
TTGGATCCTG ATAGAGTTCG CCCCCGAGAT TTCGCTCTTT CGGGCCGCGC TGGACGTGGA
CCAGAGCCTG GACCGGAGGT GGTCTTTGTA AATATAGGAA ATGCTTGGCC ATTTCACGCG
GGCTTTTTGC GAATGATACA GCGCTTACCT ACGATTGTAG TGTTGCATGA CATGGCTATC
CAAGAGCTGT GTTTTGATGC GATGGAGCGC AGTCTTCTAG ATCAAGATGT ATACCTTGAG
AATATGAGGA GGTCGCATGG CCAAGCGGGG GTGGAGGCCG CCAAACTCGT TCTGGGGGAT
GAAATGGAAG CCGGTGCTCT TGCTCGTACA TATCCCGGGT TTGAGCTGGC CTTAACGCAT
GCTGTTTCAG TTGTAACACA TAGTCGCGTT GCTCAGCAGA GCGTAGATAC CGTATCAGAC
ACTCCTACAT ATTTATTGGA CTTGCCGTTT CGCCCCTCCT TGAGGCGGCC AGAGGTTTAT
CGTTCTGGAT TAGGGCCTCT GCGCCTTGTC CAATTTGGCT ATACCGGACC TAATCGGCGA
CTTGAGAGTG TTCTAAGAAC TTTGGCTTCT TTGAAACATG AAGTCGATTT CCGTTTTGAT
GTAATGGGAA AGCTATGGGA TACTGGGTAC CTTCGCAATT TGGTCAATCA GTTAGGAATT
GAAGCTCGAG TAACCTTCCA CGGCTTTGTG GACGAAAGTT TTCTCGATCA AAAACTCCAG
CAAGCTCACT TGGTGTTTAA TCTAAGATCA CCAACCATGG GAGAGGCCTC TGGCAGCCAG
TTGCGTATTT GGAACGCTTG TGCTGCCTCC GTAGTAACCA ACTTAGGATG GTATGCGGAC
CTACCCGACG AAACAGTTTT TAAGATTAAT CCTGATCGGG AAGCTGAGGA GCTTAAATCT
ATAGTTCGCA GCTTGTATGC AGATCCGTCT CGCGGAAACC GTAAAGCTCT AGCTGGGCGG
GAGCGTCTGG AGCAAATACA TAACCCAGCT CGCTATGCTG CTGCCATTAA AAAGGTAGCT
CAAAACTTCT CTCAAGACGC AGCACACAGC TTGCTAGGTT CTAGATGTTC ATTTCCGAAA
TGTAGAACAC TGGATCAACG ATGA
 
Protein sequence
MVKSAKLRIH WISPVPPAET DIAHYTHRIL PELAEVAELV LWTDAAEWDW NLHNIAPVRH 
LDPDRVRPRD FALSGRAGRG PEPGPEVVFV NIGNAWPFHA GFLRMIQRLP TIVVLHDMAI
QELCFDAMER SLLDQDVYLE NMRRSHGQAG VEAAKLVLGD EMEAGALART YPGFELALTH
AVSVVTHSRV AQQSVDTVSD TPTYLLDLPF RPSLRRPEVY RSGLGPLRLV QFGYTGPNRR
LESVLRTLAS LKHEVDFRFD VMGKLWDTGY LRNLVNQLGI EARVTFHGFV DESFLDQKLQ
QAHLVFNLRS PTMGEASGSQ LRIWNACAAS VVTNLGWYAD LPDETVFKIN PDREAEELKS
IVRSLYADPS RGNRKALAGR ERLEQIHNPA RYAAAIKKVA QNFSQDAAHS LLGSRCSFPK
CRTLDQR