Gene TM1040_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3688 
Symbol 
ID4075657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp748946 
End bp750166 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID638005208 
Producthypothetical protein 
Protein accessionYP_611917 
Protein GI99078659 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.112899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCA GCCCCCTTCC CCGTTTCGGC CCTTCGGACC GTGGCCCCCG CATCCTGTTT 
TACAGTCACG ACACGTTTGG CCTTGGTCAC TTGCGGCGCT CGCGCGCCCT GGCAGCGGCG
ATCACTTCGG CGGACCCCAA AGCCTCTGCA ATGATCCTGA CCGGCTCGCC AGTGGCGGGG
CGATTTGCCT TTCCCAATCG CGTGGACCAC ATGCGCCTGC CCGGTGTGAT CAAGCGCGCT
GACGGCTCAT ATGCCAGCCG CACAATGGGT ATGAGCATCG AAGAGACCAC GGAGCTGCGC
GCGGGCCTCA TCCGCTCCAC CGCCGAGCAG TTTGCGCCCG ATATACTGGT GGTCGACAAG
GAACCCACCG GATTTCGCGG CGAGTTGATC CCCACGCTTG ATCTCTTGCA GGAACGCGGC
CAGGCGCGCC TTGTGCTGGG TTTGCGCGAT GTTCTGGACG AACCAGAAGT GCTGCGCGCC
GAATGGGAGC GCAAATCCGC GCTCGAGGCG GCAGAGACCT ACTATGATGA AATCTGGATC
TACGGTTTGC ATGACGTCTA TGATCCGACC GCAGGCCTGC CCCTGAGCAA AGAGACACAG
GCGCGCATGC ACTGGACGGG CTATCTGCGC CGCGATCTCG GCGAAGTGGG CGAGCCTCCA
GAGCAGCCTT ATGTTCTGAT TACCGCCGGC GGTGGCGGCG ATGGCGCGAT GATGGTGGAT
CTTGCGATTT CCGCCTATGA ACGCGATCCC ACTCTCACGC CACGCGCGAT GCTGGTCTAC
GGCCCGTTCC TGTCCGGTGA CACCCGCGCC GCATTTGAGG ATCGGGTCGC CGCCCTTGAC
GGGCGGGTCA GCGCCGTCGG CTTTGAGAGC CAGATCGAGA CGCTGTTTGC CGGAGCGCAG
GGCGTCATCT GCATGGGCGG TTACAACACG TTCTGCGAGG TGCTTTCGTT TGACAAACCG
GCCGTGATTG TGCCGCGTAC CACGCCCCGG CTGGAGCAGT GGATCCGGGC CAGCCGTGCC
GAGGAACTGG GCCTCGTGAC CATGCTCGAC GAAACCCGCG ATGGCTGGAC GCCCGAGGCG
ATGATCGGTG CGATCCGCGC GCTGGAGCGC CAGCCTAACC CCTCAAAAGC GATCTCTGAC
GGGCTACTTG ACGGACTCGA CTATGTGACC GAACGGGTCA ATGCACTGTT GCAACAACTC
CCGCGTGAGG TCAGCGCATG A
 
Protein sequence
MSASPLPRFG PSDRGPRILF YSHDTFGLGH LRRSRALAAA ITSADPKASA MILTGSPVAG 
RFAFPNRVDH MRLPGVIKRA DGSYASRTMG MSIEETTELR AGLIRSTAEQ FAPDILVVDK
EPTGFRGELI PTLDLLQERG QARLVLGLRD VLDEPEVLRA EWERKSALEA AETYYDEIWI
YGLHDVYDPT AGLPLSKETQ ARMHWTGYLR RDLGEVGEPP EQPYVLITAG GGGDGAMMVD
LAISAYERDP TLTPRAMLVY GPFLSGDTRA AFEDRVAALD GRVSAVGFES QIETLFAGAQ
GVICMGGYNT FCEVLSFDKP AVIVPRTTPR LEQWIRASRA EELGLVTMLD ETRDGWTPEA
MIGAIRALER QPNPSKAISD GLLDGLDYVT ERVNALLQQL PREVSA