Gene Lcho_3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3400 
Symbol 
ID6163217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp3781713 
End bp3782834 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content69% 
IMG OID641666175 
Product3-dehydroquinate synthase 
Protein accessionYP_001792423 
Protein GI171060074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCTG CAACGCCAAC CTCCTTGCCC TTTGCCCTGG TCCCGGTCGC GCTGGCCGAC 
CGCAGTTACG ACATCCTGAT CGGCAGCGGG CTGATCGCCT GGCAGGCCAG CTGGGCCGGC
CTGCCGGCCG CGAGCACGGC GGTGATCGTG AGCAACACCA CCGTGGCGCC GCTCCATGCC
GCGCGGGTGC GCGAGGCGCT GCAGTCGCAT TACCGGCGCG TGCTGTGCGT CGAGTTGCCC
GACGGCGAGG TCCACAAGGA CTGGCAGACG CTCAACCTGA TCTTCGATCA CCTGCTGGCC
AACGCCTGCG ACCGCAAGAC CGTGCTGGTG GCGCTGGGCG GCGGTGTGAT CGGCGACATG
ACCGGCTTCG CCGCCGCCTG CTACATGCGC GGCGTGCCCT TCGTGCAGGT GCCGACCACG
CTGCTGGCGC AGGTCGATTC GTCGGTGGGC GGCAAGACCG CCGTCAACCA TCCGCTGGGC
AAGAACATGA TCGGCGCGTT CTACCAGCCG GTGCGCGTGA TCTGCGACCT CGACACGCTC
GACACCCTGC CGCCGCGCGA ACTCGCTGCC GGACTGGCCG AGGTGATCAA GTACGGGCCG
ATTGCGGACG GCGGTTTTCT CGACTGGATC GAGGTGAATC TCGATGCGCT GCTGGCCCGT
GACAAGGCTG CGCTCGCACA CGCCGTGCGG CGCTCCTGCG AGATCAAGGC CGAGGTGGTG
GGTGGCGACG AGCGCGAGAG TGGCCGGCGT GCCATCCTCA ATTTCGGCCA CACCTTCGGC
CACGCGATCG AGGCGGGCAT GGGGTATGGC GCCTGGCTGC ACGGCGAGGC GGTCGGTTGC
GGCATGGTGA TGGCGGCCGA TCTGTCGGCG CGGCTGGGCC TGATGCCCGC GGCGTTCGTG
TCGCGCCTGC GTCACATCTG CGAGCGGGCC GGCCTGCCGG TGCGGGCACC GCGGCTCGAT
GCCACGCACA ACGTCGAGCG TTACCTGGAG CTGATGCAGG TCGACAAGAA GGCCGAGGAC
GGCCAGATCC GCTTCGTCGT GATCGACGCC ATGGGCAGCG CACGCATGCA GGCGGCACCC
GAGGCACTGG TGCGCCAGGT CATCGAGGCG GCTTGCGCCT GA
 
Protein sequence
MNAATPTSLP FALVPVALAD RSYDILIGSG LIAWQASWAG LPAASTAVIV SNTTVAPLHA 
ARVREALQSH YRRVLCVELP DGEVHKDWQT LNLIFDHLLA NACDRKTVLV ALGGGVIGDM
TGFAAACYMR GVPFVQVPTT LLAQVDSSVG GKTAVNHPLG KNMIGAFYQP VRVICDLDTL
DTLPPRELAA GLAEVIKYGP IADGGFLDWI EVNLDALLAR DKAALAHAVR RSCEIKAEVV
GGDERESGRR AILNFGHTFG HAIEAGMGYG AWLHGEAVGC GMVMAADLSA RLGLMPAAFV
SRLRHICERA GLPVRAPRLD ATHNVERYLE LMQVDKKAED GQIRFVVIDA MGSARMQAAP
EALVRQVIEA ACA