Gene Clim_1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1834 
Symbol 
ID6355175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2012750 
End bp2014516 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content55% 
IMG OID642669438 
Productglycosyl transferase, WecB/TagA/CpsF family 
Protein accessionYP_001943852 
Protein GI189347323 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1922] Teichoic acid biosynthesis proteins 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCACA CGGCAGGCAT TGTGCTCGAA AACCGTTCGG GATCTGCAGA AAGAGCGAAA 
GCCATCATCT TCGGCAGAAA AGAACAAGTC GGCGACATCT GCAAACTGAT CGAAAACAGT
TACAGCCAGG TCACCCTTGC CGAAACATCC CGTGAACTCG AAAAATCAGG GCCCGACAAA
GAACAATTCA GGCTTGCAGT AATGACCGAC AGTTTTTCCG AAAAACTCAG CATAGGGCTC
CTCAACCGCG TCAGACAGAA ATTCGACCCT GAAAACATGA TCTGCCTCTC CGGAGAAATA
GAAGAAGACA ACGAAATAAC GCTGCGCTCC GCAGGGCTGA TCTTCCTGGG CAGCTACGGA
ACATTTTTCG CCCATGCCGA CAACATCATC AAACACACCC TGAACAACGG CGGCAACAAC
GCAATGAAAC CGGAAATCGA GACCGCAAGA AATCTGGAAA AACGCCTCAA AAGAGCGCGA
GGCTCAGGAA GAAGCCGAAG AAAACGGTTA TCATTCTTAA TCACATCCTC CATAGCGGAA
ACAGCCGCGA GGAGCATCGA ACTCCTGACA GCACTTGCCG TCACCATCAC GCTCTTCATT
CCCGTACTGC TTATCCGGCT CCTTATACGC ATCCCATGCG GTCAACCGGT CTTTTCCAGG
CGAACGGTCT GCGGCATGGC CGGCCAACCC ATCACTATCC GTACCTTCAG CGACCTCAAG
GGACGAATGG CCGATCTTCC GCTCTTCCTC GAACTCTTTA CCGGACGCCT TGCCCTTGCA
GGTACCGCAA TCAGAGAGTG GGACGCTCCC GACCCCAATG CCGAACAAGC CTACATCAGC
ATGGTCAAAC CCGGCATCAT ATCACTCTGG GACATCCGCC GTACCAGCAA AATCGCGCAC
GAAGGACGCG AAGCCATCGA ATGGGAATAT ATCTTCAGCA AACGCCCGGC CTATGACCTG
CTGCTTCTGC TCAGAGCACT GCCTGCAATG CTCTACAGCG AAACGACCTC CACATACGAT
CCGGTATTCA GGCTGCTCGG ACTTGACATC GACAACATCA CCATGGCTGA AGCGGTCTCG
CTCATACAGA CCGACCTCCG CGACAACCGG CAGCAAGCCA TCTATTTCGT CAATCCAGAC
TGCCTGAACA AAATGGCCGG AGACAGGGAG TACTGCGAAG TCCTGAAAGA CGGCGACAGC
ATATTCCCCG ACGGCATCGG CCTCACCATT GCCGGAAAAC TCCTGCAGAG CCCCCTCAAA
GAAAACATCA ACGGCACAGA CATGCTCCCC TATCTCTGCA GGATGGCGGC AGCCGAACGA
CACAGCATAT ACCTGCTCGG CGGCAAACCC GGCATAGCCG ACAAAGCCGC AAGCAAAATC
AACCGCGAAT TCGGCGTCAC CATCGCAGGC ACCGCCGACG GCTACTTCAA CCACGAAACC
GAAACAGGCC GCATCATCGA CGATATAAAC CGCTCCGGAG CCTCCATCCT GCTCGTAGCA
TTCGGAGCCC CGCTGCAGGA AAAATGGATC CACCGCCACC GAAACCGGCT CCAACCCGCG
CTCCTCATGG GTGTAGGCGG ACTCTTCGAC TTCTACTCGG GCAACGTTCG TCGCGCCCCT
CGCTGGATGC GTGAAATCGG CATCGAATGG ATATACAGGA TCATGCAGGA ACCCGGACGG
ATGTGGCGTC GCTACGTCAT AGGCAACCCG CTCTTCCTCT ATCGCGTCAT GAAATGGAAA
CTCCTAACCG GCAGCGGCAA CCACTGA
 
Protein sequence
MYHTAGIVLE NRSGSAERAK AIIFGRKEQV GDICKLIENS YSQVTLAETS RELEKSGPDK 
EQFRLAVMTD SFSEKLSIGL LNRVRQKFDP ENMICLSGEI EEDNEITLRS AGLIFLGSYG
TFFAHADNII KHTLNNGGNN AMKPEIETAR NLEKRLKRAR GSGRSRRKRL SFLITSSIAE
TAARSIELLT ALAVTITLFI PVLLIRLLIR IPCGQPVFSR RTVCGMAGQP ITIRTFSDLK
GRMADLPLFL ELFTGRLALA GTAIREWDAP DPNAEQAYIS MVKPGIISLW DIRRTSKIAH
EGREAIEWEY IFSKRPAYDL LLLLRALPAM LYSETTSTYD PVFRLLGLDI DNITMAEAVS
LIQTDLRDNR QQAIYFVNPD CLNKMAGDRE YCEVLKDGDS IFPDGIGLTI AGKLLQSPLK
ENINGTDMLP YLCRMAAAER HSIYLLGGKP GIADKAASKI NREFGVTIAG TADGYFNHET
ETGRIIDDIN RSGASILLVA FGAPLQEKWI HRHRNRLQPA LLMGVGGLFD FYSGNVRRAP
RWMREIGIEW IYRIMQEPGR MWRRYVIGNP LFLYRVMKWK LLTGSGNH