Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0543 |
Symbol | |
ID | 6973940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 600274 |
End bp | 601515 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643390076 |
Product | integrase family protein |
Protein accession | YP_002274952 |
Protein GI | 209542723 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000277419 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0534863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAGC ATAGTTCTAG CAGCGTGGCG GGAGCGTTTA ATTCGCAGCC ATGGTCCATC TACAAGCAGA CAAACTCAAA AAACTGGTTC GTCCGCTTCT CGATCAAGGG ACAGGGGCAG ATTCGAAAGT CCCTTCAGAC CGCAGATGAG AAGGAAGCAG ATCGTAAGGC TGCACGGATC TATTATGAGG CTCTCTTACG TGCAGAGCAG GGCTTGGAAG CCAGAGACAA GACTGTAAGG GTCCTGGCAG ATGAGTGGAT CGCAGAGGGC GTCCTGAAGG CTCCTGAACG GACTGCTCTC TCCCGATATA TCGTGGGATA CTTCGGGGAC GATAAGCCAT CAGACATCAC CTACAGGGGG CTTCAGGCGT ATCGGAAATG GCGCTTAAGC TATTGGACAG AAGGGCCTGG TAAGAATCTG GAGATGGTCG TCTATCAGCG ACTCGGGCGA CAGATTCGTA GGCCGGTGAC GAGAAAGACC CCTACCCGCT CCACACTCAA CAGTGAGCAG GTCGTCTTTA AAAAATTCTT AACCCGATGT CAGAATCTCG GGCACATAAA AACAATTCCC AAGTTTGATA AGATTGAAGG TGAAGTTCAC AATCGGCCTG GGTTTTCTCA TGCCGAGATT GAAAAAATCT TGTCAGTCCT GAGAAAGAGG ACCGTCCTCC CCGCTCTATC CAATGAGGAA CGTTATCCAC ATATTCTCCT CTATGGATAT GTCGGAATTA TGTGTGGTTC GGGGATGCGC CCCATTGAGT GTCAAAAACT CAGATGGATC GATCTAGTTG GTTTCGATGA ATCTCGAAAC GCTAAACTTT GTGAAGGTCG AATTACCGTC CGGGTTCATG GCAAAGGTAA ATCAAGAGAG TTTGTACCTC TTGATGGGAC AATCTCTGAT TTTCTAATGA TATGGGATGT TCAGAAAATA ATACGGGGAT CTGATCCAGA TCCCAATGAT TATATCTTTG TTGACATCAA GGGTAGGCAT ATCCAGACCT TTAACCCAGA AGTAGTATCA CTTCTGGAAG AATGCTCTCT GCGACAGGAT TACCGAGGAA TAAAGAGAAC ATCTTATTCG TTCAGACACT ATTATATTAC GTTCATGATT AACGCACATG CGAACATATA CGATATCGCC AAAAACTGCG GCACATCAGT AGCTATGATT GAAAAGTTCT ATTCACATGT CACGCTTGAA TCGATACGAG ACAGACTGCG GCCTTCTGGA ACGCGGATCT AG
|
Protein sequence | MGKHSSSSVA GAFNSQPWSI YKQTNSKNWF VRFSIKGQGQ IRKSLQTADE KEADRKAARI YYEALLRAEQ GLEARDKTVR VLADEWIAEG VLKAPERTAL SRYIVGYFGD DKPSDITYRG LQAYRKWRLS YWTEGPGKNL EMVVYQRLGR QIRRPVTRKT PTRSTLNSEQ VVFKKFLTRC QNLGHIKTIP KFDKIEGEVH NRPGFSHAEI EKILSVLRKR TVLPALSNEE RYPHILLYGY VGIMCGSGMR PIECQKLRWI DLVGFDESRN AKLCEGRITV RVHGKGKSRE FVPLDGTISD FLMIWDVQKI IRGSDPDPND YIFVDIKGRH IQTFNPEVVS LLEECSLRQD YRGIKRTSYS FRHYYITFMI NAHANIYDIA KNCGTSVAMI EKFYSHVTLE SIRDRLRPSG TRI
|
| |