Gene Lcho_3897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_3897 
Symbol 
ID6161928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp4377610 
End bp4379097 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content67% 
IMG OID641666675 
Productanthranilate synthase component I 
Protein accessionYP_001792916 
Protein GI171060567 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.839724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCG AACTCGAATT CAAGAGCCTG GCCGCCGAAG GCTACAACCG CATCCCGCTG 
GTCAGCGAGG CGTTTGCGGA TCTCGAAACC CCGCTGTCGC TCTACCTCAA GCTCGCCGGC
GGCAAGCCGC GCAGCTTCCT GCTGGAGTCG GTGGTGGGCG GCGAGCGTTT CGGGCGCTAC
TCGTTCATCG GCCTGCCCGC GCGCACGCTG CTGCGTTGCA CGCTGGGCGT GACCGAGGTC
GTCACCGACG GCCAGGTGGT CGAGCGCCAC GAGGGCAATC CGCTCGATTT CATCGAGGCC
TACCAGAATC GCTTCAAGGT CGCGCTGCGG CCCGGCCTGC CGCGTTTCTG CGGCGGGCTG
GCCGGCTACT TCGGCTACGA CACGGTGCGC TACATCGAGA AGAAGCTCGC CGACACGCGC
CTGCCCGGCG GCATCGGCAC GCCCGAGATC CTGCTGCTGC AGACCGAGGA GCTGGCGGTG
ATCGACAACC TGTCGGGGCG GCTGTACCTG ATCGTCTACG CCGACCCGCG CCAGAGCGAG
GCCTATTTCA GCGCGCAGAA ACGCCTGTCC GAGCTGAGCG ACCAGCTGCG CTACAGCGTC
ACCGCGCCGC AGGTCAAGCG CGGGCCGTCG TACCCGGTCG AGCGCGAATA TCCGAAGGCC
GAGTTCGAGG CCGCGGTGCT CAAGGCCAAG GAGTACATCG CCGCCGGCGA CTGCATGCAG
ATCGTGATCG GCCAGCGCCT CAAGAAGCGC TACACCGAGA ACCCGCTGTC GCTGTACCGC
GCGCTGCGCT CGCTCAATCC GAGCCCGTAC ATGTACTACT ACGACATGGG CGAGTTCCAG
ATCGTCGGCT CGTCGCCCGA GATCCTGGTG CGCCAGGAGG CGCGCAGCGC CAACGGCAAG
ACCGAGAAGA TCGTGACCAT CCGCCCGATC GCCGGCACCC GCCCGCGCGG CGCCACGCCC
GAGATGGACG CGCAGAACGA GGTCGACCTG CTGGCCGACC CGAAGGAGCG CGCCGAGCAC
CTGATGCTGA TCGACCTGGC GCGCAACGAC ATCGGCCGCA TCGCCAACAC CGGCACCGTC
AAGGTGACCG AGGCCTTCGG CATCGAACGT TATTCGCACG TGATGCACAT CGTCAGCAAC
GTCGAGGGTG TGCTCAAAGA CGGCATGAGC AGCCTCGACG TGCTGCGCGC CAGCTTCCCC
GCCGGCACCT TGTCAGGGGC GCCGAAGATC CACGCGATGG AGATCATCGA CCAGCTCGAG
ATCAGCGAGC GCGGCATCTA CGGCGGCGCG GTCGGCTACC TGAGTTTTGC CGGCGACATG
GACGTGGCGA TCGCCATCCG CACCGGCATC GTCAAGGACC AGACGCTCTA CGTGCAGGCC
GGCGCCGGCG TGGTCGCCGA CTCGGTGCCC GAGATGGAGT GGAAGGAAAC CGAAGTGAAG
ATGCGCGCCG TGATCCGCGC GGCTGAACTG GTCGAGGAAG GTTTCTGA
 
Protein sequence
MITELEFKSL AAEGYNRIPL VSEAFADLET PLSLYLKLAG GKPRSFLLES VVGGERFGRY 
SFIGLPARTL LRCTLGVTEV VTDGQVVERH EGNPLDFIEA YQNRFKVALR PGLPRFCGGL
AGYFGYDTVR YIEKKLADTR LPGGIGTPEI LLLQTEELAV IDNLSGRLYL IVYADPRQSE
AYFSAQKRLS ELSDQLRYSV TAPQVKRGPS YPVEREYPKA EFEAAVLKAK EYIAAGDCMQ
IVIGQRLKKR YTENPLSLYR ALRSLNPSPY MYYYDMGEFQ IVGSSPEILV RQEARSANGK
TEKIVTIRPI AGTRPRGATP EMDAQNEVDL LADPKERAEH LMLIDLARND IGRIANTGTV
KVTEAFGIER YSHVMHIVSN VEGVLKDGMS SLDVLRASFP AGTLSGAPKI HAMEIIDQLE
ISERGIYGGA VGYLSFAGDM DVAIAIRTGI VKDQTLYVQA GAGVVADSVP EMEWKETEVK
MRAVIRAAEL VEEGF