Gene Lcho_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0301 
Symbol 
ID6161498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp317981 
End bp319852 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content71% 
IMG OID641663045 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001789341 
Protein GI171056992 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC AATCGCTCGA CAGCATCCTC ACCCGCATCC GCCCGCGCCG CGAAGTGCTC 
GCGCTGGCGA TCGACGCGCT GGTGGTGGCG GCCTGCTGGC ACATCACCTA CCTGTTCCGG
CTCGGCTTCG AGCGCTGGCA CAGCGCCCGG CCGGATTACG ACGTCTGGGT CATGCTGGCG
CTGGTGGCGC TGTATCTGGG CGTGTTCGTC GCCTTGCGCG TGCCCAAGGG CATGTGGCGC
TTCTCGGGTT TCGGCGAGGT GCAGCGGCTC ACGTTGGCGT GTGCGATCGC CGGGCTGGTC
GGCGCGGTGG CCGTGCTGAT GGCGCAGCTG TCGCAGGTGC CGCGCGCGGT GCTGGCGCTG
CACCCGGTCG TCAGCCTGAT GGGGCTGGCG ATGGTGCGCA TCGGCTACCG CATGTTGTAC
GAACACATGC GTGGGCGCAT CTCCGGCAGC GCCACCGAAA CCCGCCGCGC GCTGGTGATG
GGCGCGGGCG ACGCGGCGCG GCTCTTGATC GCCGGCATCC AGCACCACGG CTGGGTGGTG
GTCGGCCTGC TCGACGACGA TCGGCGGCGC CTGGGCACAC GCGTCAGCAA CGTGCCGGTG
CTCGGGCCGC TGGACAGTGC GCCGCGCTGG GCCGAGCTGC ACGGCATCAG CCACATCATC
GTCGCGCTGC CGTCGGCCAC GCCGGCCGAA CGCCGCCGCG CGCTCGACCT GGCCGCCGCC
ACCCATCTGC CGGTGGTGAC GGTGCCCAGC GCCGCCGAGC TGCGCGAGGG AACCACGGTG
ACGCGGGTGC GCGAGATCGA GGCCGAAGAC CTGCTCGGCC GCGAGCCGGT GCAGCTCGAC
GAAGGCGGCA TCAGCGAGGC GCTGGGTGGC AAGGTGGTGC TGATCACCGG CGCGGGCGGT
TCGATCGGCT CGGAGCTGTG CCGCCAGGTG GCGCGTTACG GCCCGCTCAA GCTGGTGCTC
TACGAGCTGA GCGAGTTCGC GCTCTACCGC ATCGAGCAGG AGCTGAGCGA GCACTTCCCG
CATATCCCGC TGGTGCGGCT GGTGGGCGAC GTGCGCGACC CGGAGCACCT GCGCGCCACC
TTCACACGCG TGCGCCCGCA GGTGGTGTTC CACGCCGCGG CCTACAAGCA CGTGCCGCTG
ATGGAGGAGG ACAACGCCTT CGCCGCCTTG CGCAACAACA CGCTCGGCAC CTGGCGCGCA
GCCAGCGCGG CGGCCGAGGC GGGCGCCGAA CGTTTCGTGC TGATCTCGAC CGACAAGGCC
GTCAACCCGA CCAACGTGAT GGGCGCGAGC AAACGCGCGG CCGAGATGGT GATCGCCAAG
CTCGCGGCCG AGGTGCTGGC GCGCGGCGGG CGCACGCGTT TCATGGCGGT GCGTTTTGGC
AATGTGCTGG GTTCGTCGGG CAGCGTGATC CCGAAGTTCA AGGAGCAGAT CGCCCGCGGC
GGGCCGGTGA CGGTGACACA CCCCGACATC ACGCGCTTCT TCATGACCAT CCCCGAGGCT
GCGCGACTGG TGGTGCAGGC CGCGGCGATC GGCGAGGGCG GTCAGGTGTT CGTGCTCGAC
ATGGGCGAGC CGGTGCGCAT CGTCGACCTG GCGCGCGACC TGATCCGCAT GAGCGGCCAT
TCGGCCGACG AGATCCCGAT CACCTTCAGC GGCCTGCGCC CGGGCGAAAA GCTCTACGAA
GAACTGCTGG CCGACGCCGA CGCGACGCTT GCGACGCGCT TCGAGCGCCT GCGCATCGCC
CGCCTCGACG ACCGCGGCCA CGACGTGCAG GCATTGCTCG ACTGGGCCGC CGAGCGCAGC
AGCGCGCCCG ACGACGAAGT GCGCGAACGG CTGGCGCGGC TGGTGTCGGA ATACCGCCGC
GCCGGGCATT GA
 
Protein sequence
MIWQSLDSIL TRIRPRREVL ALAIDALVVA ACWHITYLFR LGFERWHSAR PDYDVWVMLA 
LVALYLGVFV ALRVPKGMWR FSGFGEVQRL TLACAIAGLV GAVAVLMAQL SQVPRAVLAL
HPVVSLMGLA MVRIGYRMLY EHMRGRISGS ATETRRALVM GAGDAARLLI AGIQHHGWVV
VGLLDDDRRR LGTRVSNVPV LGPLDSAPRW AELHGISHII VALPSATPAE RRRALDLAAA
THLPVVTVPS AAELREGTTV TRVREIEAED LLGREPVQLD EGGISEALGG KVVLITGAGG
SIGSELCRQV ARYGPLKLVL YELSEFALYR IEQELSEHFP HIPLVRLVGD VRDPEHLRAT
FTRVRPQVVF HAAAYKHVPL MEEDNAFAAL RNNTLGTWRA ASAAAEAGAE RFVLISTDKA
VNPTNVMGAS KRAAEMVIAK LAAEVLARGG RTRFMAVRFG NVLGSSGSVI PKFKEQIARG
GPVTVTHPDI TRFFMTIPEA ARLVVQAAAI GEGGQVFVLD MGEPVRIVDL ARDLIRMSGH
SADEIPITFS GLRPGEKLYE ELLADADATL ATRFERLRIA RLDDRGHDVQ ALLDWAAERS
SAPDDEVRER LARLVSEYRR AGH