Gene TM1040_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3854 
Symbol 
ID4074917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp106169 
End bp107140 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content65% 
IMG OID638004511 
ProductKpsF/GutQ family protein 
Protein accessionYP_611246 
Protein GI99077987 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC GAGAACTGAC CTGTGATGAA ACCCTTGCCG AAATGGCCCG CGTGCTGACC 
GTGGAGGCCG CCGCGCTGAC TCAGATGGCT TCTGAGGTGG GCGATCCACA GCTCAAGGCG
GTCGAGATTC TCGAGGCCAT GGAAGGCCGT GTGATCGTGT CGGGCGTCGG CAAATCCGGC
CATATCGGCA ATAAGATCGC CGCCACCCTG GCCTCGACGG GGACGCCTGC GCAATTTGTG
CATGCCACCG AGGCGAGCCA CGGCGATCTT GGCATGGTGA CGCCGCGCGA TGTCTGTCTG
GTGATCTCCA ATTCCGGCGA AACCTCCGAG CTGGCCGATA TCGTCACCTA TAGCCGCCGC
TTTGCTATTC CGCTCATTGC CATCACCCGC AAGGCCGACA GCACCCTCGC GACCCAGGCC
GATGTGGTGC TGCTGCTGCC CGATGCGCCC GAGGCCTGCG GCATCGGCAT GGCCCCCACC
ACCTCGACCA CGGCAACGCT GGCGATGGGG GATGCGCTGG CGGTGGCCCT GATGAAACGG
CGCGGCTTTG AGCGCGAGGA TTTCAAGGTC TTCCACCCCG GCGGCAAGCT CGGCGCGCAG
CTGATGCTGG TGGATGGGCT GATGCACACG GGCGAGGCGC TGCCGCTGGT GGCGCCAGAG
ACACCGATGA CAGAGGCGCT TTTGATCATG ACCGCCAAGG GCTTTGGCCT TGCGGGGCTG
GTCGAAGGTG GCCGCCTCAC GGGCATCATC ACCGACGGCG ATTTGCGCCG CAATATGGAT
GGTCTGATGG CGCGCAGCGC CGGCGAGGTG GCCACCCGCG GCCCCAAGGT GATCCGGCGC
GGTTCGCTGG CCTCCGAGGC GCTCCACGAC ATGAACAGCC GCAAGATCTC GGCGCTGTTT
GTGCTCGATA ATGAGGACCG GGTGGCGGGC TTGCTGCATA TCCATGACTG CCTGCGGGCT
GGGTTGGCTT GA
 
Protein sequence
MTPRELTCDE TLAEMARVLT VEAAALTQMA SEVGDPQLKA VEILEAMEGR VIVSGVGKSG 
HIGNKIAATL ASTGTPAQFV HATEASHGDL GMVTPRDVCL VISNSGETSE LADIVTYSRR
FAIPLIAITR KADSTLATQA DVVLLLPDAP EACGIGMAPT TSTTATLAMG DALAVALMKR
RGFEREDFKV FHPGGKLGAQ LMLVDGLMHT GEALPLVAPE TPMTEALLIM TAKGFGLAGL
VEGGRLTGII TDGDLRRNMD GLMARSAGEV ATRGPKVIRR GSLASEALHD MNSRKISALF
VLDNEDRVAG LLHIHDCLRA GLA