Gene Avin_46640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_46640 
SymboltrpE 
ID7763527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4733112 
End bp4734590 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content69% 
IMG OID643807508 
Productanthranilate synthase component I 
Protein accessionYP_002801744 
Protein GI226946671 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGCG AAGAATTCCT GCGCCTGGCC GCCAAAGGCT ACAACCGCAT TCCGCTCGCC 
TGCGAAACCC TCGCCGACTT CGACACCCCG CTGTCGATCT ACCTGAAACT CTCCGACGCG
CCCAATTCCT ACCTGCTCGA ATCGGTGCAG GGCGGCGAGA AATGGGGCCG CTACTCGATC
ATCGGCCTGC CGGCGCGCAC CGTGCTGCGC ATCCATGGCC AGCGGGTGAC GGTGAGCGTG
GACGGCGAGG AGGTGGAGCG CCACGACTGC GAAGACCCGC TGGCCTTCGT CGAGCAGTTC
AAGGCGCGCT ACCGGGTGCC GGACCTGCCG GGACTGCCGC GCTTCAACGG CGGCCTGGTC
GGCTACTTCG GTTACGACAG CGTGCGCTAC GTGGAGAAGA AGCTCGCCCG TTGCCCGAAC
CCGGACCCCC TGGGCACCCC GGACATCCTG CTGATGGTTT CCGACGCCGT GGTGGTGTTC
GACAACCTGG CCGGCAAGAT GCACCTGATC GTCCTCGCCG ACCCGGCCGA GGCGGACGCC
TTCGAGCGGG GCCGGGCGCA CCTGCGCGCG CTATCGGAGG AACTGCGCCA GCCGCTGGCG
CCGCGCCAGG GCATCGATCT CGGCGCGCCG GCTGGCGCGG AGCCTGCGTT CCGCTCCAGC
TTCGGCCGCG AGGACTTCGA GCGGACGGTG GCGCGCATCA AGGACTACAT CCTCGCCGGC
GACTGCATGC AGGTGGTGAT CTCCCAGCGC ATGTCGATCC CCTTCGCCGC CGCGCCCATC
GACCTGTACC GGGCGCTGCG CTGCTTCAAC CCGACCCCCT ACATGTACTT CTTCGACTTT
GGCGACTTCC ACGTGGTCGG CAGCTCGCCG GAGGTGCTGG TGCGCGTCGA GGACGGTCTG
GTCACGGTGC GCCCGATCGC CGGCACCCGC CCGCGCGGCG CCAGCGAGGA GGCCGACCTG
GCGCTGGAGC GCGACCTGCT CTCCGACGCC AAGGAGCTGG CCGAGCACCT GATGCTGATC
GACCTCGGCC GCAACGACGT CGGCCGGGTC GCCGATACCG GCTCGGTGAA GCTCACCGAG
AAGATGGTCA TCGAGCGCTA TTCCAACGTC ATGCACATCG TCTCCAACGT CACCGGCCAT
CTGCGCCAGG GGCTGACGGC GATGGACGCG CTGCGCGCCA TCCTGCCGGC CGGCACCCTC
TCCGGCGCGC CCAAGGTCCG CGCCATGGAG ATCATCGACG AACTGGAGCC GGTCAAGCGC
GGCGTCTACG GCGGCGCCGT CGGCTACCTG GCGTGGAACG GCAACATGGA TACCGCGATC
GCCATCCGCA CCGCGGTGAT CAAGGACGGC GAACTGCACG TCCAGGCCGG CGCCGGCATC
GTCGCCGATT CGGTGCCGGC GCTGGAGTGG GAGGAAACCC TGAACAAGCG CCGCGCCATG
TTCCGCGCCG TGGCCCTGGC CGAGCAGGGC GGGACCTGA
 
Protein sequence
MIREEFLRLA AKGYNRIPLA CETLADFDTP LSIYLKLSDA PNSYLLESVQ GGEKWGRYSI 
IGLPARTVLR IHGQRVTVSV DGEEVERHDC EDPLAFVEQF KARYRVPDLP GLPRFNGGLV
GYFGYDSVRY VEKKLARCPN PDPLGTPDIL LMVSDAVVVF DNLAGKMHLI VLADPAEADA
FERGRAHLRA LSEELRQPLA PRQGIDLGAP AGAEPAFRSS FGREDFERTV ARIKDYILAG
DCMQVVISQR MSIPFAAAPI DLYRALRCFN PTPYMYFFDF GDFHVVGSSP EVLVRVEDGL
VTVRPIAGTR PRGASEEADL ALERDLLSDA KELAEHLMLI DLGRNDVGRV ADTGSVKLTE
KMVIERYSNV MHIVSNVTGH LRQGLTAMDA LRAILPAGTL SGAPKVRAME IIDELEPVKR
GVYGGAVGYL AWNGNMDTAI AIRTAVIKDG ELHVQAGAGI VADSVPALEW EETLNKRRAM
FRAVALAEQG GT