Gene TM1040_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0076 
Symbol 
ID4075973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp78842 
End bp79999 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content57% 
IMG OID638005363 
Productlipopolysaccharide biosynthesis protein-like 
Protein accessionYP_612071 
Protein GI99079917 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.39731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC CTCTATGGAA GATCAAACGC GAGCTGAAGC GTCAAGGCAG GCAGCTAAAG 
AACATTGGGC CTCGTCTTGC GAGCCTTTTG TTTTCGCGGA CCTATTATGA TCTATTCCTT
TCTGGAAAGA AGACCGTCAC CGAAGGTCAG ATCCCCAAGC GTCCAAAGGT CGCGGTCTAC
CTGATCTTTC CAAGCTCCGG CATTCTCGGG TCTCATATCG AGGCGCTGCG GTATCTTGCA
CAGAACGGAT ACGCGCCCGT CGTGGTCTCA AACCTGCCCC TTGATGAGGG CGACCTTGAA
CACCTGCGCG CTCATGCGCA TCTCGTCATC CAGCGTCCGA ACTATGGCTA TGATTTTGGC
GGCTACCGCG ATGGTGTTCT TGAGGTCGCC GCGCGTCACA CAAGCGTGGA GCGTTTTGTC
CTGCTCAACG ATTCTGCCTG GTTCCCCTTG CCTGGGTCGC GCAATTGGCT GGCCGATGCC
GAAGCGCTGG AGCTGGATTT CGCGGGTGCG GCGACGAACT ATGGCCACAC GCGGGTCGAT
CCGAAGAATT TCCGGGACAT CCGCTGGCAC TATTCCAGCA ATCACCGCAA CTTTCACTAC
TGCTCCTTCG CTTTGATGAT GAGCGGGAAG CTGTTCAACG ATAAGCGGTT TCAGCGCTTC
TGGAAGAGCT TTCCGCTCAC CAATGACAAG ACCGTCACGG TAAGGCGGGG CGAGATTGGC
CTCACGAAAT GGGTCATCCA GCAAGGTTTC TCGCATGGCT CTACGCTCGA TATTGCCGCG
CTTGATCAAA GGCTGGCTGA ATATGGCATA GATGAGCTGC GCGCGATTGC CGCGCAGACC
CTGATGCCGC AGAGCCCCTC GATGAAAGAG GTCCTGGAAG ACACCGTCCG CTCGGCAGAG
AGCAAAGAAG ATTTGGTCAA CGTGATCCTG ACTGCGATCG CGAGGAAAGG CATCAGCTAT
GCGCAACCTC GTCTGATCCA CCGTGACTAC GGGTTCGCGT TCCTCAAGAA ATCGCCGCTT
TGGCTGGATG AGGATGCCTC AAACCTGACC CTCGCCTTCA CCCGCGACCT TGATGGAGAA
TTTGGCAAAG TCCTGCAGGC TGAGGCATTG GACCTGCGCC GGACAAGAGC GGCGGAATTT
GCACCCGCCC CGGACTGA
 
Protein sequence
MSLPLWKIKR ELKRQGRQLK NIGPRLASLL FSRTYYDLFL SGKKTVTEGQ IPKRPKVAVY 
LIFPSSGILG SHIEALRYLA QNGYAPVVVS NLPLDEGDLE HLRAHAHLVI QRPNYGYDFG
GYRDGVLEVA ARHTSVERFV LLNDSAWFPL PGSRNWLADA EALELDFAGA ATNYGHTRVD
PKNFRDIRWH YSSNHRNFHY CSFALMMSGK LFNDKRFQRF WKSFPLTNDK TVTVRRGEIG
LTKWVIQQGF SHGSTLDIAA LDQRLAEYGI DELRAIAAQT LMPQSPSMKE VLEDTVRSAE
SKEDLVNVIL TAIARKGISY AQPRLIHRDY GFAFLKKSPL WLDEDASNLT LAFTRDLDGE
FGKVLQAEAL DLRRTRAAEF APAPD