Gene Gdia_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0156 
Symbol 
ID6973548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp170440 
End bp172008 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content71% 
IMG OID643389690 
Productanthranilate synthase component I 
Protein accessionYP_002274571 
Protein GI209542342 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.970752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0258031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCC CCTCCGCCTC CGCCGCCCCG GTTCCGGCCG GGCGCGACGA CGTGCTGGCC 
ACGCTCCGGC AGGGACAGGG CGCGGTGGTC TGGAGCATCG AGGCCGCGGA CCTGCTGACC
CCGGTCGCCG CCTATATGCG CCTGTCGCGC CTGGCCGGGG CCAGCGACAC GGCGCCCCCG
CGCAACGCGT TCCTGCTGGA AAGCGTCGAG GGCGGGGTGG CGCGCGGCCG GTATTCGGTG
ATCGGCCTGC TGCCCGACCT GATCTGGCGC TGCCATGGCG GCGCGGCCAC GATCAACACC
GACGCGGCGC GGGACCCGGC CGCGTTCGTG CCGGCCGGGG TGCCGCCGCT GGATTCGCTG
CGCGCCGTGA TCCGCGCCAG CCAGATGACG TTGCCGTCCG GCCTGCCGCC CATGGTGGCC
GGGCTGTTCG GCTATCTGGG CTATGACATG GTCCGGCAGA TGGAGCATCT GCCGGACATG
CCGGCTGACG ACCTGGACCT GCCCGAAGGG GTGATGATCC GCCCCGGGCT GTTCGCGATC
TTCGATACGG TGCGCGACGA ACTGATCCTG GCGGCGCCCG TGCGCCCCCG AAGCGACCGC
ACGCCCGAAG CGGCATGGCA GGCGGCGCAG GACCTGCTGG CCACGGCGCG GCGCACCCTG
TCCGAGCCGC TGCAACTGCA CGAGATCACG CCGGATTATA CCGGGCCGCT CGAGGCGCCG
CGCTCGACCT TCACGCGTGA GGGTTTCTGC GCCATGGTCC GGCGCATTCA GGACTACATC
GCGGCGGGCG ATGCCTTCCA GGTCGTGCCC AGCCAGCGTT TCTCGACCGC CTTCACGCTG
CCGCCGCTGG CGCTGTACCG GGCGCTGCGC CGCATCAATC CGGCGCCGTT CCTGTTCAAC
CTGGCGTTCG ACGGATTCAG CCTGGTGGGC TCGTCGCCCG AAATCCTGGT CCGGCTGCGC
GACGGGCAGA TGACGGTGCG CCCGCTGGCC GGCACCCGCC CGCGCGGCCG GACGGACGAG
GAGGATCTGG CGCTGGAGCG GGACCTGCTG GCCGACCCCA AGGAACTGGC CGAGCACCTG
ATGCTGATCG ATCTGGGGCG CAACGATATC GGGCGGGCCT GTACCGTGGG TTCGGTCCAG
GTGACCGAGA AATTCGTCAT CGAGCGCTTC AGCCACGTCA TGCACATTTC CTCGAACGTC
GAGGGGCAGT TGCGGCCGGG GCTGGAGGCG CTGGATGCCC TGATCGCGGG CTTTCCCGCC
GGGACCCTGA CCGGCGCGCC GAAGATCCGT GCGATGGAGA TCATCGACGA GGTCGAGCCG
ACCCGCCGCG CCACCTACGC CGGATGCATC GGCTATTTCG GGGCGAACGG CGCCATGGAT
ACCTGCATCG GCCTGCGCAT GGCCGTGGTC AAGGACGGGC AGATGCACGT GCAGGCCGGC
TGCGGCGTGG TGGCCGACAG CGTGCCCGAC CTGGAATACG AGGAAACCCG GCACAAGGCG
CGTGCCCTGT TCCGCGCGGC CGAGGACGCT GTGCAGTTCG CCCGCGGGCA GAACACGGCG
GGATCATAA
 
Protein sequence
MSIPSASAAP VPAGRDDVLA TLRQGQGAVV WSIEAADLLT PVAAYMRLSR LAGASDTAPP 
RNAFLLESVE GGVARGRYSV IGLLPDLIWR CHGGAATINT DAARDPAAFV PAGVPPLDSL
RAVIRASQMT LPSGLPPMVA GLFGYLGYDM VRQMEHLPDM PADDLDLPEG VMIRPGLFAI
FDTVRDELIL AAPVRPRSDR TPEAAWQAAQ DLLATARRTL SEPLQLHEIT PDYTGPLEAP
RSTFTREGFC AMVRRIQDYI AAGDAFQVVP SQRFSTAFTL PPLALYRALR RINPAPFLFN
LAFDGFSLVG SSPEILVRLR DGQMTVRPLA GTRPRGRTDE EDLALERDLL ADPKELAEHL
MLIDLGRNDI GRACTVGSVQ VTEKFVIERF SHVMHISSNV EGQLRPGLEA LDALIAGFPA
GTLTGAPKIR AMEIIDEVEP TRRATYAGCI GYFGANGAMD TCIGLRMAVV KDGQMHVQAG
CGVVADSVPD LEYEETRHKA RALFRAAEDA VQFARGQNTA GS