Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1543 |
Symbol | |
ID | 6974953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1719692 |
End bp | 1721233 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643391074 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002275937 |
Protein GI | 209543708 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.397228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.410252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTTTCC GCTCTCATAC GCGCCGCGAC GCGCTTCAGA TCCTGGCCGG GGGCGCTATC GCGGCCTGCG CGCCGCCGGG CGCGCGGGCG CAGGCGGTGG AAATCTCCCG GCGCGGGGGG CGCATCCGCG TCGCCGGCTT TTCCGGGTCC AGCGCCGATA CGCTCGACCC CGCGCGCGGG GCGCTGTCCA CGGATTATAT CCGTGGCGCG ATGTTCTATG ACGCGCTGAC GGAACTGGAC GAGGCGCTGC AGGTACAGCC CTCCCTGGCC ATTGCGATCG AAAGCGACGA CGCCATCCGC TGGACGATCC GGCTGCGCAA GGGGGTCCGC TTTCATGACG GCTCCCCGCT GACGGCGGAC GATGTGGTGT TTTCGCTGCT GCGGCATCTC GACCCGCGCG TGGGCTCGCA GCAGAAGGCG ATCGCCCGGC AGTTCGGCAG CATCCGCGCG CGTGCCGCCG ACGAGGTCGA GTTGACGCTG GTCGCGGCCA ATGTCGATCT TCCGGCCCTT CTGTCCCTGG CGCCGTTCTA CATCATCAAG AACGCCACGA CCGATTTTTC GCGCGCCAAC GGCACGGGCC CGTTCCTCTG CCAGGAATTC AGCCCCGGCA TCCGCTCGGT CGCCGTGCGC AATCCGGACT ATTGGCGCGA CGGACAGCCC CACCTGGACG AAATCGAATT TTTCACCATC GCCGATGACA TGGCGCGCCA TGATGCGCTG ATGTCCGGGG ACGTCGACCT GATCGGCGGC GTCAATCCGC GGCTGGCGCC CCTGCTGCGG CAGCGCGGCC TGCAGATCAT GGAAGCGCCG GGCGGGGCCT ATACCGATTT CATCATGCGG CTGGACCAGG CGCCTGGCAA TAACCCGGAT TTCGTGCGCG GAATGAAATA CCTGTTCAAT CGCGAGCAGA TGAAAAGCGC GATCTTCCGG GGCTACGCGC AGGTCGGGAA CGACCAGCCG ATCGCACCGG GGTTTCCCTA TTTCGATGCG TCGCTGCCCC AGACCGTGCA GGACCTGGAC CGCGCGCGCT ATCATTTCCG GCGATCCGGC CTGACGGGCA GCCGGGTGCC GATGGTCTGT TCGCCGGCGG CGGAGGGATC GGTCGAAATG GCGATCCTGT TGCAGCACGA TGCGCGGCCG CTGGGCATCG ACCTGGCCAT CCAGCAGGTC CCGGCGGACG GGTACTGGTC GAATTACTGG ATGCAGGCGC CGATATCCTT CGGCAATCTG AACCCCCGGC CCCGGGCCGA AATGGCGTTT TCGCTGTCCT ATGCCTCGGA CGCGCCGTGG AACGAATCCC GCTGGCGCAA TCCTGAATTC GACCGGCTGC TGCGCGCCGC GCGGGGCGAG CGCAATGAAA GCCTGCGGGC CGAGATGTTC GCGCAAATGC AGGTCCTGGT CCATGACGGC AGTGGGGTGT GCATTCCGCT GTTCCTGAGT GATATCGACG CGTTTTCACC CCGTCTGCGC GGCATGCGGC CCAGGAAGAC GGGGGAGTTC ATGGGCTTCG AATTCGCCCG CCATGTCTGG CTGGCGTCAT GA
|
Protein sequence | MAFRSHTRRD ALQILAGGAI AACAPPGARA QAVEISRRGG RIRVAGFSGS SADTLDPARG ALSTDYIRGA MFYDALTELD EALQVQPSLA IAIESDDAIR WTIRLRKGVR FHDGSPLTAD DVVFSLLRHL DPRVGSQQKA IARQFGSIRA RAADEVELTL VAANVDLPAL LSLAPFYIIK NATTDFSRAN GTGPFLCQEF SPGIRSVAVR NPDYWRDGQP HLDEIEFFTI ADDMARHDAL MSGDVDLIGG VNPRLAPLLR QRGLQIMEAP GGAYTDFIMR LDQAPGNNPD FVRGMKYLFN REQMKSAIFR GYAQVGNDQP IAPGFPYFDA SLPQTVQDLD RARYHFRRSG LTGSRVPMVC SPAAEGSVEM AILLQHDARP LGIDLAIQQV PADGYWSNYW MQAPISFGNL NPRPRAEMAF SLSYASDAPW NESRWRNPEF DRLLRAARGE RNESLRAEMF AQMQVLVHDG SGVCIPLFLS DIDAFSPRLR GMRPRKTGEF MGFEFARHVW LAS
|
| |