Gene Clim_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1852 
Symbol 
ID6355193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2032241 
End bp2033434 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content57% 
IMG OID642669456 
Productglycosyl transferase family 2 
Protein accessionYP_001943870 
Protein GI189347341 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACCA TGAACAGCAT CATCGCCATC ATCATCTCCC TGCTCTCGCT GCCGGCACTC 
TACCTCCTGC TGACAACCAT AGCAGCCTAC CTCTTCAAAA AAAAAGAAAA TGCGCCGAAC
AGCTTCCTCA ATATCGGCGT GCTCATCCCG GCCCATAACG AAGAAGAAGG CATCGCACGC
ACGGTCAGAA ACGTACTTGC ATGCGACTAC CCGGCAAACC GTCGCTACAT CTTCGTCATC
GCCGACAACT GCACCGACAG CACCGCCGAA ACCGCCCGCA ATGCCGGAGC CACAGTATGC
GAACGTACCG ACGAGATCAA CCGGGGCAAA GGCCAGGCGC TCGACTGGTT CCTCAGAAAA
CAGAAAAACA TTTACAAAGA CACCGATGCC ATCACCATCA TCGACGCAGA CGTAAACCCC
GACACCGGCT ATCTCCGCGA AATCAGCCTC TCCCTCAGCC AACTCGACAC ACAAGCCATC
CAGGCCTACA ACGGCGTCAG CAACCCAAAA TCCGGATGGC GTCCAGGACT TATCGACGCA
GCCTTCAACG TATTCAACCA CCTCCGCATG GCGGGCTCCT GTCAGCTCAG CGGTACCTGC
GTGCTCAAAG GCAACGGCAT GGCATTCCGC ACCGCCCTGC TCCAGCGCAC AGGATGGCCC
TGCCACTCCA TCGTCGAAGA CATGGAATTC AGTCTTCGAC TGCTCCAGGA AGAAATCGAC
GTCCACTACA ATCCCGACGC CATCATCCGC AGTGAAATGG TCACCAGCGG CAATAACGCC
ACCAGCCAAC GCAGCCGATG GGAAGGCGGA CGGTTCACCC TTGTCCGGCA AATGGCCGGC
CCACTCCTCA AGCTCTTCCT TACAACAGGC CGATCCAGCT ACCTCATCGC GCTCACCGAA
CTCGCCCTGC CGCCGCTCTC GCTGCTCGTC CTGCTCTTCG CCATCGGCAC AGCAGCCGCT
CTGCTCATCG GAAACCCGAC ACAGCAACTC ATCACACTCT CATGGTGGGC AATACTCATC
ATCTACGTGG CATCAGGACA GATCCAGCGA AAAGCTCCAC TTTCCACATG GGCCGTACTC
CTCGCCGCAC CGCTCTACAT TCTCTGGAAA ATCCCCATCT ACGCAGCCAT GCTGCTCCGA
AAAAAAAGCA CCTCCTGGGT ACGCACCACA AGAGAGCACA ACAAAACCAA CTGA
 
Protein sequence
MSTMNSIIAI IISLLSLPAL YLLLTTIAAY LFKKKENAPN SFLNIGVLIP AHNEEEGIAR 
TVRNVLACDY PANRRYIFVI ADNCTDSTAE TARNAGATVC ERTDEINRGK GQALDWFLRK
QKNIYKDTDA ITIIDADVNP DTGYLREISL SLSQLDTQAI QAYNGVSNPK SGWRPGLIDA
AFNVFNHLRM AGSCQLSGTC VLKGNGMAFR TALLQRTGWP CHSIVEDMEF SLRLLQEEID
VHYNPDAIIR SEMVTSGNNA TSQRSRWEGG RFTLVRQMAG PLLKLFLTTG RSSYLIALTE
LALPPLSLLV LLFAIGTAAA LLIGNPTQQL ITLSWWAILI IYVASGQIQR KAPLSTWAVL
LAAPLYILWK IPIYAAMLLR KKSTSWVRTT REHNKTN