Gene Clim_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0914 
Symbol 
ID6354151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp996661 
End bp997968 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content58% 
IMG OID642668541 
Productglycosyl transferase family 2 
Protein accessionYP_001942972 
Protein GI189346443 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG3222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.262835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGG AATCATCTTC CGGGCGCATG CTCATCGTCT TTACCCGCAA CCCGGTGCTT 
GGCAGGGTAA AGACGCGGCT CGGCGCTGAA ACAGGCCCGG AAACCGCGCT CAGGGTCTAC
CGCATGCTCA GGGAGCTGAC CGCCTCGGTT ACGGAGGCGT GCAGCGCCGA GCGCGCGGCG
TTCTACTCGG ATGAGATTCC GGATGCAGAC TGCTTTCTGA GAGGCGGAAC GCTCGCCTTT
CTTCAGGAAG GCAGCGATCT CGGCGAACGG ATGCTTCATG CCTTTGAAAC CGGCTTTGCC
GGGGGCTTCG GGCACATCGC GCTCATCGGC ACGGACTGTC CGGATCTGCA GACCAGCATA
CTCGAACAAG CCTTTACCGA GCTCGAAAAC CATGATGCCG TGCTCGGGCC GGCAAAGGAT
GGCGGGTTCT ACCTTATCGG ACTGAACAAA AGCCATCCCG AGCTCTTTCT CGACCGATCC
TGGAGCCACA GCCGCGTCCT GCAGGAGACC ATCGACAGGC TGAACGAATA CGAAACAACG
TTCGGCCTGC TCCCGGAGCT GCAGGACATC GACACCCTGG AGGATCTCAG GCAAAGCCGA
CTCCGGAGTG CTGAAATGAT GGGTTCATTG AGCATTATAA TCCCGACCTT TAACGAGGAA
ACCGGCATCG CCCGGACGCT GGATACCCTG CTCGCCCTTA CCGGAAGATA TGACGACGTA
GAGATCATTG TCAGCGACTC GGGCACCGAC CGTACAGCCG AAATCGTCTC GGCCTTCCCC
GTCACGCTCT GCCGGTCGGA AAAAGGACGC GCCCGACAGA TGAACGCGGG AGCCAAGCTC
GCCAGACACC ATACGCTCTA CTTTCTGCAC GCGGACACCC TGCCGCCCGA ACGATTTGTC
GATGACATTG TCGATGCGGT CGGAAGCGGA AAAGAGGCGG GATGCTTTCG GATGCAGTTC
GACGACCCGC ACCCCATCAT GACCCTCTTC GGCTGGTTCA CCAGAGTTCC CCTTTCGATC
TGCCGGGGCG GCGACCAGTC GCTCTTCATA ACAAAAGAGC TGTTCGACGC TCTCGGCGGG
TTCGACGAAA GGATGCAGGT GATGGAGGAT ATCGACATCA TCGAGCGCAT CGAGCGCCGG
GGAACCTTTC ACATCCTCGA CAACCACGTC GTGACTTCGG CAAGGAAATA CCATAAAAAT
GGCATCCTGC GTCTGCAGGC GATCTTCGGC ACCATCCATC TGATGTATGC GCTGGGGTAT
GATCAGGAGA GCATTATCCG TTACTACCAG GAAAATATCG AATCGTAA
 
Protein sequence
MKQESSSGRM LIVFTRNPVL GRVKTRLGAE TGPETALRVY RMLRELTASV TEACSAERAA 
FYSDEIPDAD CFLRGGTLAF LQEGSDLGER MLHAFETGFA GGFGHIALIG TDCPDLQTSI
LEQAFTELEN HDAVLGPAKD GGFYLIGLNK SHPELFLDRS WSHSRVLQET IDRLNEYETT
FGLLPELQDI DTLEDLRQSR LRSAEMMGSL SIIIPTFNEE TGIARTLDTL LALTGRYDDV
EIIVSDSGTD RTAEIVSAFP VTLCRSEKGR ARQMNAGAKL ARHHTLYFLH ADTLPPERFV
DDIVDAVGSG KEAGCFRMQF DDPHPIMTLF GWFTRVPLSI CRGGDQSLFI TKELFDALGG
FDERMQVMED IDIIERIERR GTFHILDNHV VTSARKYHKN GILRLQAIFG TIHLMYALGY
DQESIIRYYQ ENIES