Gene Clim_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1850 
Symbol 
ID6355191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2029939 
End bp2031027 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID642669454 
Productglycosyl transferase family 2 
Protein accessionYP_001943868 
Protein GI189347339 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACACA CCCATCAGAA CGACGGAAGC CAGGAATACC TGCCATCCAT CGACTGCGTA 
CTCATCGGAG TCAACTGCGC CAGCACCCTC AAACGATGCA TCGACTCCAT CCTGGCCTGC
GACTATCCCA AAGAAAAGCT CCGCATCATC TATGTTGACG GAGGATCAAG CGATACGAGC
AAAGCCATAG CGACGGCATA TCAAAACGTC ACGCTTATCG CGCTCGACCT CCTGCACCCG
ACTCCAGGCC TGCAGCGCAA TGCTGGATGG AAAAACGGAA CGGCCCCCTT CGTGCAATTC
CTCGACTCCG ATACCATCAT CGACCCCGCC TGGCTCCGTG CTGCGACAAC AGCCATACAA
GACCCGGCAA TCGGAGCAAT CAACGGCTAT CGCCGCGAAC TGCACCCCGA ACGCACCATC
TACAACTGGA TAGGCGACAT CGAATGGAAC GGCCCTCCAG GACAATCAGA CTGCTTCGGC
GGCGACGTAC TCATCCGGCG CACTGCACTT GAAGAAAGCG GCGGATACGA CGAAACCCTT
GTCGGAGGCG AAGACCCCGA ACTCAGCCGG AGAATTATCA GAAACGGATG GCAGATCAGG
CGCCTCTACG CCCTCATGAC CAGCCACGAC CTTGCCATGA CCACAATCAG GCAATATCTC
AAACGAGGCT TCCGATCCGG TTACGGCTTC GCTGCCGTTC GCCTGCGCGA AGCAAAAGCA
GGCAGCAGCT TCTGGAAACC GGAAAACCGC AAAATCCTCA TCAAAGGCGG CGGATTCCTC
ATCGGCGCAA CAGCGGCGCC CCTCATTGCG CTCACGCAGC ACAACGTCCG GGGAACAATC
CTCTCGCTCG CGAGCCTGCT CGGCGGCACA GCCCTGCTCC TCAACCCCAG GATATTCAAA
GTCGAAAAAT TCATGCGCGA CAACAAACTC CGCCGCGAAG AAGCAAAAAT CTACGCATGG
CACTGCTCGC TCGTCGTGCT GCCACAGCTC CTCGGCATAA TCCGATTCCA TGCCGGCCGA
CTCCTCGGAA AACCGCTCAC GAACAAACGA GCGGTACTCA AAACCGGACT CTCAACCACC
CGGACATGA
 
Protein sequence
MKHTHQNDGS QEYLPSIDCV LIGVNCASTL KRCIDSILAC DYPKEKLRII YVDGGSSDTS 
KAIATAYQNV TLIALDLLHP TPGLQRNAGW KNGTAPFVQF LDSDTIIDPA WLRAATTAIQ
DPAIGAINGY RRELHPERTI YNWIGDIEWN GPPGQSDCFG GDVLIRRTAL EESGGYDETL
VGGEDPELSR RIIRNGWQIR RLYALMTSHD LAMTTIRQYL KRGFRSGYGF AAVRLREAKA
GSSFWKPENR KILIKGGGFL IGATAAPLIA LTQHNVRGTI LSLASLLGGT ALLLNPRIFK
VEKFMRDNKL RREEAKIYAW HCSLVVLPQL LGIIRFHAGR LLGKPLTNKR AVLKTGLSTT
RT