Gene Gdia_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0158 
SymboltrpD 
ID6973550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp172733 
End bp173863 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content69% 
IMG OID643389692 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002274573 
Protein GI209542344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.523516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0330567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGCG TGCCGTCCCT GTCTGCCGAC CAAGCGGGTG CGTTCCGGAC CATCCTGAAC 
CGCCTGGCGC GGGGCGAAAC ACTGACTGAA ACCGAGGCCG AGGACGCGTT CGGCCTGATC
ATGGACGGCG GCGTGCCCGA CACGCTGATC GCCGCCTTCC TGATGGCGTT GCGGGTCCGG
GGTGAGAAGC GCGCGGAACT GCTGGGCGCG GTGCGCGCCG TGCGGTCACG CATGCGGGCG
GTCGGCCCCG TTCCTCCGGG CACGATCGAT GTCTGCGGCA CGGGCGGCGA CGGCCTGGGC
ACACTGAATA TTTCGACCGC CGTCGCCTTC GTCCTGGCCG CGCTGGGTGT TCCGGTCGCC
AAGCATGGCA ACCGGGCCCT GTCGTCGCGT TCCGGCGCGA CCGACGTGCT GGGCGCGCTG
GGCGTGGATC TGTCGGACGA CCCGTCGGTG ATCGCCGCGC GGATCAATGA CGGAAACCTG
GCTTTCATGG CGGCGCCGGC GCACCATCCG GCCATGCGCC ATGCCGGACC GGTGCGGGCC
GCGCTGGGAA TCCGCACGCT GTTCAACCTG ATCGGCCCCT TGTGCAATCC GGCCGGCGTC
ACACATCAAC TGGTCGGTGT GTTCGATCCG GCATGGCTGC GCCCGGTGGT GGAGACGTTG
CAGCTTCTGG GGTCGGAGCG CGTGTGGGCC GTGCATGGCT ATTGCGAAGG CGCGACGGGC
GGCCGGGGCG TGGACGAACT GACGCTGGCC GGTCCTACCG CGATCGTGGC GTTGCAGAAC
GGACGGATTT ACGACCTGAC GTTGCGGCCC GAGGATGCGG GCCTGCGCCC CGCGCCGATC
ACGGCGATCG CGGGTGGCGG GGCGGAGGAA AATGCGGCGG CCCTTACGGC TTTGCTGGCG
GGCGCCCATG GAGCCTATCG CGATACCGTG CTGCTGAACG CGGCGGCCTG CCTGCATGTG
GCGGGACGCG GTGCGGCACT GGATGACGAT GGGAGATTGA GACCGGCGTC GCTACGGGCG
CTGGTGGCGG ATGCGGCCCG CGTGCTGGAT GACGGATCGG CCCTGGCCAT GCTGAATTCC
GCGCGCCGTC GCCACATGGA TACGCCGGAG GGGATTACAC AAAGCTTATG A
 
Protein sequence
MDGVPSLSAD QAGAFRTILN RLARGETLTE TEAEDAFGLI MDGGVPDTLI AAFLMALRVR 
GEKRAELLGA VRAVRSRMRA VGPVPPGTID VCGTGGDGLG TLNISTAVAF VLAALGVPVA
KHGNRALSSR SGATDVLGAL GVDLSDDPSV IAARINDGNL AFMAAPAHHP AMRHAGPVRA
ALGIRTLFNL IGPLCNPAGV THQLVGVFDP AWLRPVVETL QLLGSERVWA VHGYCEGATG
GRGVDELTLA GPTAIVALQN GRIYDLTLRP EDAGLRPAPI TAIAGGGAEE NAAALTALLA
GAHGAYRDTV LLNAAACLHV AGRGAALDDD GRLRPASLRA LVADAARVLD DGSALAMLNS
ARRRHMDTPE GITQSL