Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0306 |
Symbol | |
ID | 6973698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 340136 |
End bp | 342070 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643389837 |
Product | hypothetical protein |
Protein accession | YP_002274718 |
Protein GI | 209542489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.919182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00578351 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCACGA CCACGACCTA CAAGACCGCT TCCGGGGTGA CCTACACCGT CACCGACACC TCCTACCTGG GGCAGAACGA CTACGACGTC ACGATCACGG CCAGTGACGG CACGGTCCTG CTCGATCAGG ACAATATCAA TCGCGGCATC GACATCCTCG GCGTCATCGA CCTCGCGGCT TCCGGCGATG TCATTACGGG GTCCACCGAT TCCACCTCGC TGGTCAGCCT TGCCAGCATC GGAACCTATG TCAGCGTTCC CGGCGCGACG GGCAACTTCA TCGTCGGCGC CGGTGCCCTT GCCGCCAACA CATATTACAT CGGCGGAACG ACGACGATAT CCGGCCTGGC GAACCTGGTC ACCGGAACGA CCATCAATGT CGTAGGGGGC ACGGCAACGC TGTCCGGCAA CAGCGGCAGC ACCCTGCTCG GCGCCCTGAA CGGATCGACG GTCAACATCG AATATGGCGG CACGTTCAAC ACCGGCGCCG CCCTCGGCAG CCTCCTGGAA GGGGCGACGG TTTCGTTCGG GAGCGGCGGC GGCACGCTGG TCATCAATGG CGGCGGCACC GCCATCAGCC TGCTGGCATC CGGTCCGCTG TCGGCCACGA CCATCCAGAA CTACGATCCG TCCAGGGACA CGATCGAACT TCAGGACACC GTCGCTCCGA TCTCGGGCTA CACGATATCC GGCGATACGA CCCGGACGAT CACCCTGTAC GGAAGCGACG GCACGCAGGT CGCCACCTAC ACGGTCAACC TGGCATCCGG CGTAAACCTG GCCAACGGAA CATACAATGC CGTCAACAGC ACGCAGGGCA ATCCCCTGAA CATCACCTAC ACCACCGGCA ATACCTATAT CGGCGTCTGC TTCCTGGCGG ATTCGATGAT CCGCACACCG TCCGGCGACA TCGCGGTCCA GGACATCCGC GTGGGCGACG AAATCCTGGC CTGCCCGGAC GGTGAGTGGC GGGACGGCGA ACAGGGGACC GGCGAACGGC TCGGCACGGT CGTCTGGACC GGCAAGGCCC ATGCCACCGT TCGTCCCGGC CTTCCGGATG ACGAGGCCGG CTATCCGGTC CGCATCCTCA GGGACGCCAT CGCCGACGGC GTGCCCTACA AGGACATGCT GGTCACGCCG GAACATTGCC TGTTCCTCGA CGGCGTCTTC ATTCCGGCCC GCATGCTGGT CAACGGCCGG TCCATCTTCC ACGACCGCAC CATTACCGCC TACGATTATT ACCACATCGA AACGGAACGG CATTCCGTCA TTATCGCGGA CGGCATGCCG ACCGAAAGCT ATCTCGATAC CGGAAACCGC CGCTCTTTCC GTCAGGATGG GAAAATCGTC CATATCGGCG CGGGCAACGC CCGATCCTGG ACGGAAGACG CCGCCGCCCC GCTGGGGGTA ACGCGCGCGG TGGTCGAACC CGTCTTCCGC CGGATCGAGG CCCGGGCGCG GGACGCGGGC ATCGCCAGTG CGATCGTGGG GCCGGTCCTG ACCGACGATA CCGGCCTTCA CCTGCTGACC GAGAACGGTC AGGCCATCCG CCGGACCCGC GATGCCAACG GGTATGCCAC GTTCCTGATC CCGCCAGATG TCGGCACCGT CCGCATCGTC TCGCGCACCA GCCGGCCCAG CGATGCGATC GGCCCCTTCC TCGACGACCG GCGACGCCTC GGCGTGCTGG TCGGCGGAAT CACCCTCCTC GATTCGGACA ATATGCGTCT GATCGACACC TATCTGGCCG ATCCGGCACT CGACGGATGG GATGTCCAGG AAGCCGGTCC GCACCGCTGG ACCAATGGCG ATGCCATACT GCCGCTGGGC CCGCGCCGCC CCGACAGCTT CGGCATCCTG GCCATCCAGG TACTGGCGGG CGGCCCCTAT CCCGTTACGG CGGAGGCAAC GGAAGCAGTC GCCCAATCAG CGTGA
|
Protein sequence | MSTTTTYKTA SGVTYTVTDT SYLGQNDYDV TITASDGTVL LDQDNINRGI DILGVIDLAA SGDVITGSTD STSLVSLASI GTYVSVPGAT GNFIVGAGAL AANTYYIGGT TTISGLANLV TGTTINVVGG TATLSGNSGS TLLGALNGST VNIEYGGTFN TGAALGSLLE GATVSFGSGG GTLVINGGGT AISLLASGPL SATTIQNYDP SRDTIELQDT VAPISGYTIS GDTTRTITLY GSDGTQVATY TVNLASGVNL ANGTYNAVNS TQGNPLNITY TTGNTYIGVC FLADSMIRTP SGDIAVQDIR VGDEILACPD GEWRDGEQGT GERLGTVVWT GKAHATVRPG LPDDEAGYPV RILRDAIADG VPYKDMLVTP EHCLFLDGVF IPARMLVNGR SIFHDRTITA YDYYHIETER HSVIIADGMP TESYLDTGNR RSFRQDGKIV HIGAGNARSW TEDAAAPLGV TRAVVEPVFR RIEARARDAG IASAIVGPVL TDDTGLHLLT ENGQAIRRTR DANGYATFLI PPDVGTVRIV SRTSRPSDAI GPFLDDRRRL GVLVGGITLL DSDNMRLIDT YLADPALDGW DVQEAGPHRW TNGDAILPLG PRRPDSFGIL AIQVLAGGPY PVTAEATEAV AQSA
|
| |