Gene Gdia_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1898 
Symbol 
ID6975321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2112954 
End bp2114348 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content74% 
IMG OID643391424 
ProductChorismate binding-like protein 
Protein accessionYP_002276273 
Protein GI209544044 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0230156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGTC CGGTTGGCAT CGGGGACCTT CACGCCGTCG AACTGCCGTG GCGGGACCCT 
GATGACGTGC TGTGGGCGTG GCGGGACGAA CCGTGGCTGG CCTGCCTGGA CAGCGGCGGG
CCGGCGGGGC CGCGCGCGCG CTGGACCATC CTCTGCCGCC GGCCGCGCCA GGTCCTTGAA
TGGCGGGACG GCGGGGCCCC GCTGGCATCT GATCCGCTGG CGGCCCTGCG CGCCCTGCTG
CCGCCGGCCG GATCGCCGCC CGTGACCGCA TCCGGCGAGG CCCTGCCGTT CGCGGGCGGG
GTGATCGGCT TCGCCGGGTA CGGGGTGGGG CGGCGCCTCG AAGGGATCGC GACCCGCCAC
GTGGCCGATC CGGCCCAGCC GGACCTGGCG GCGGGGTTCT ATGACCATGC CGTCCTGTTC
GACCGTGTGC GGCGGCGCGC CTACCTGGCG ACGATCCGCC CCGATCCGGA CCGGCTGATC
GCCGACGTAA GCGCCGCGTG GGCGGCCATG GAGCCGGCCC CGGCCCCCGC CGCCCTGCCG
CGCCTGGCCT TCGCCGCCGA CCAGGCGCCG GGGCAGTACG CCGCCGCGGT GGCGCGCGCG
GTGGAGCGGA TCGCGGCGGG CGACATCTTC CAGGTCAACA TCACCGGGCG CATGGCCGCC
CGCCGCCCGC CCGGCCTGAC GGACGGCGCC ATCTACCGGG CCCTGCGCCG GGCCTCGCCC
GCGCCGTTCG GGGCGTGGCT GGCCTGCGGG CCGGGTTTCG GCCTGCTGTC GGCCTCGCCG
GAGCGGTTCG TGCATCTCGG CCCTGACGGG GTGGCCCGCA CCCGGCCGAT CAAGGGCACC
CGCCCGCGCG GCGCCACCCC GGCACAGGAT GCCGCCCGCC GTGTCGAACT CGCGGCCGAT
GAGAAGGAAC GGGCGGAAAA CCTGATGATC GTGGACCTGA TGCGCAACGA TCTGGGTCGT
GTCGCGCGGA TCGGCAGCGT CGGTGTCCCG GAACTGCTGT CGGTGGAGAC CTTCACCCAT
GTCCATCACC TGGTGTCCGA GGTCACGGCC ACCCTGGCCC CGGGACGGGA TGCCATCGAC
CTGCTGCGCG CCACCCTGCC GCCCGGATCG GTGACCGGCG CGCCGAAGCA CCGCGCGATG
CAGATCATCG ACGAACTGGA ATCGTCGGCG CGGCAGGCCT ATTGCGGGGT GGTCTTTCGC
ATCGGCACCG ATGGCGCGAT GGACAGTTCG GTCGTCATCC GCGCCCTGGC CACGACGCCG
GATGCGATCG TGGCGGCGGC GGGGGGCGGG ATCACGATCC TGTCGGACCC CGGGCGGGAA
TATGCGGAAA TGTGCCTGAA GATCGCGCCG CTGCTGGCCC TGTTCGGGGC CGAGCCGGCC
GGGATGGCGT CATGA
 
Protein sequence
MDGPVGIGDL HAVELPWRDP DDVLWAWRDE PWLACLDSGG PAGPRARWTI LCRRPRQVLE 
WRDGGAPLAS DPLAALRALL PPAGSPPVTA SGEALPFAGG VIGFAGYGVG RRLEGIATRH
VADPAQPDLA AGFYDHAVLF DRVRRRAYLA TIRPDPDRLI ADVSAAWAAM EPAPAPAALP
RLAFAADQAP GQYAAAVARA VERIAAGDIF QVNITGRMAA RRPPGLTDGA IYRALRRASP
APFGAWLACG PGFGLLSASP ERFVHLGPDG VARTRPIKGT RPRGATPAQD AARRVELAAD
EKERAENLMI VDLMRNDLGR VARIGSVGVP ELLSVETFTH VHHLVSEVTA TLAPGRDAID
LLRATLPPGS VTGAPKHRAM QIIDELESSA RQAYCGVVFR IGTDGAMDSS VVIRALATTP
DAIVAAAGGG ITILSDPGRE YAEMCLKIAP LLALFGAEPA GMAS