Gene TM1040_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1402 
Symbol 
ID4075895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1495642 
End bp1496802 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID638006712 
Productlipid-A-disaccharide synthase 
Protein accessionYP_613397 
Protein GI99081243 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0763] Lipid A disaccharide synthetase 
TIGRFAM ID[TIGR00215] lipid-A-disaccharide synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.873415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.256697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCCTCA GGGTGTTTGT CCTTGCGGGG GAGCCTTCGG GTGACCGCCT TGGCGCGGCG 
CTCATGCGGG GCCTCAAAAC GCTCGCGCCC GACGTTTCCT TTGAAGGCGT CGGCGGCAGT
CTGATGCAGA CCGAGGGGCT GAAGTCGCAA TTTCCCATGG AAGAGCTGTC CGTGATGGGG
ATTGCCGAGG TCTTGCCGAA GTATTTCGAC CTCAAGCGCC GCATTCAGGA AACCGCCGAT
GCGGTGGTGG CGATGAAGCC TGACGTAATG ATCACCATCG ACAGCCCTGA TTTTTCTCTG
CGGGTGGCAA AGTTGGTGAA AGACGCCAGC GATATTCGAA CCGTTCATTA TGTTGCGCCC
TCCGTCTGGG CGTGGCGGCC GGGGCGCGCG ACAAAGATGG CGAAGGTCAT CGATCATGTG
CTGGCACTGT TGCCGTTCGA GCCGCCTTAT ATGGAAGCCG CCGGGATGGA GTGCGATTTT
GTCGGCCATC CGGTTGTGGC TGAGCCCAAG GCGAGCGAGG CGGAAATTGC AACGTTTCGC
GCGGCGTTTG ATCTGGGCGA TGCGCCCGTT CTCTTGGCGC TGCCGGGCTC GCGGCGGTCC
GAGGTGGAGC GCCTTGCTGA TGTGTTCGGT GCAGCACTTG CACAGTTCAA AGCCAAACAC
CCCGACCACC GGATCGTTGT CCCATCCGCA TCACATGTGG CGCCTATGGT GCGCGAGGCA
CTGGCGAATT GGCCTGCGGA CAGCCTCGTG CTGGATCCGG CGGATCATGC GCCCGCAGTG
TTTGCCGCGC ACAAGCGCGC AGCCTTTGCC ACTGCCGATC TGGCGCTGGC TGCGTCTGGG
ACTGTCTCGC TCGAATTGGC CGCGGCGCGT ACACCGATGG TGATTGCCTA TCGGTTCAAC
TGGCTCACCT GGCAGATCAT GAAGCGCATG GCGCTGATTG ATACGGTGAC ATTGGTCAAT
CTGGTGAGCG ACACCCGCGT GGTGCCGGAA TGCCTTGGTC CCAATTGCAC CGCCGAAACC
ATTGCGGCGC GTCTCGATCA GGTGTCGATG GCACCCGAGG CGCAGCAAGA TGCCATGCGC
CTCACGATGG AACGGGTGGG GGAAGGCGGT GAAGCGCCGG GTCTACGTGC CGCCCGCGCA
GTTCTCGCGC GGCTCCCATA A
 
Protein sequence
MGLRVFVLAG EPSGDRLGAA LMRGLKTLAP DVSFEGVGGS LMQTEGLKSQ FPMEELSVMG 
IAEVLPKYFD LKRRIQETAD AVVAMKPDVM ITIDSPDFSL RVAKLVKDAS DIRTVHYVAP
SVWAWRPGRA TKMAKVIDHV LALLPFEPPY MEAAGMECDF VGHPVVAEPK ASEAEIATFR
AAFDLGDAPV LLALPGSRRS EVERLADVFG AALAQFKAKH PDHRIVVPSA SHVAPMVREA
LANWPADSLV LDPADHAPAV FAAHKRAAFA TADLALAASG TVSLELAAAR TPMVIAYRFN
WLTWQIMKRM ALIDTVTLVN LVSDTRVVPE CLGPNCTAET IAARLDQVSM APEAQQDAMR
LTMERVGEGG EAPGLRAARA VLARLP