Gene Lcho_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_0940 
Symbol 
ID6160221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1004178 
End bp1005299 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID641663691 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001789977 
Protein GI171057628 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones120 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCA AGCACAGCCC CCACGCCGCC GATCATTGGC CTGCACCCGT GGACAAGACC 
TCGCAGACCG ATGACGAAAG GATTGTTGAC GTGGTGCCAT TGCCTCCCCC CGAACACCTG
ATCCGCTTCT TCCCGATCAG CGGAACGCCG GTCGAGACCC TGATCGGCCA GACCCGCCAC
ACCATCCGCG AGATCCTGCA TGGCCGCGAC GACCGCCTGC TGGTGATCAT CGGCCCGTGC
TCGATCCACG ACCCCGCCGC CGCGCTCGAA TACGCCCGCC GCCTGCTGCC GTTGCGCCAG
AAATACGCCG GCACGCTGGA GGTGGTGATG CGCGTGTACT TCGAGAAACC GCGCACCACG
GTCGGCTGGA AGGGCCTGAT CAACGACCCG TACCTCGATG AGAGCTACCG CATCGACGAG
GGCCTGCGCA TCGCGCGTCA GCTGCTGCTC GACATCAACC GCCTGGGCAT GCCCGCCGGC
AGCGAGTTCC TCGACACCAT CAGCCCGCAG TACATCGGCG ATCTGATCGC CTGGGGCGCG
ATCGGCGCGC GCACCACCGA GAGCCAGGTG CACCGCGAAC TGGCCTCGGG CCTGTCGGCG
CCGATCGGCT TCAAGAACGG CACCGACGGC AACATCAAGA TCGCCACCGA TGCGATCCAG
GCTGCCGCCG GCGCGCACCA TTTCCTGTCG GTGCACAAGA ACGGCCAGGT GTCGATCGTC
GAGACCCGCG GCAACAAGGA TTGCCACGTC ATCCTGCGCG GTGGCAAGGC GCCCAACTAC
GACGCCGAGA GTGTCGCCGC CGCCTGCAAG GACCTGGCGG CGGCCAAGCT CGAGCAGCGT
CTGATGGTCG ACTGCAGCCA CGCCAACAGC AGCAAGCAGC ACCAGCGCCA GATCGACGTG
GCCCGCGACA TCGCCGCGCA GATGGCCGGC GGCAGCCGCT CGATCTTCGG CGTGATGGTC
GAGAGCCACC TGGTGGCCGG CGCGCAGAAG TTCAGCCCCG GCAAGGACGA TCCGCGCAAC
CTGGCCTTCG GCCAGAGCAT CACCGACGCC TGCATCGGCT GGGACGACTC GGAGCAGGTA
CTGGAAATCC TGCATCAGGC GGTTCAGGCG CGCCGCGGCT GA
 
Protein sequence
MNAKHSPHAA DHWPAPVDKT SQTDDERIVD VVPLPPPEHL IRFFPISGTP VETLIGQTRH 
TIREILHGRD DRLLVIIGPC SIHDPAAALE YARRLLPLRQ KYAGTLEVVM RVYFEKPRTT
VGWKGLINDP YLDESYRIDE GLRIARQLLL DINRLGMPAG SEFLDTISPQ YIGDLIAWGA
IGARTTESQV HRELASGLSA PIGFKNGTDG NIKIATDAIQ AAAGAHHFLS VHKNGQVSIV
ETRGNKDCHV ILRGGKAPNY DAESVAAACK DLAAAKLEQR LMVDCSHANS SKQHQRQIDV
ARDIAAQMAG GSRSIFGVMV ESHLVAGAQK FSPGKDDPRN LAFGQSITDA CIGWDDSEQV
LEILHQAVQA RRG