Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1799 |
Symbol | |
ID | 6975220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1993000 |
End bp | 1994595 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643391323 |
Product | integrase family protein |
Protein accession | YP_002276174 |
Protein GI | 209543945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0370566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.65053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCGCG CCATCCCGTC CCGACGCACA GGACAGCCCC GTTATCACTT CCGCCGTATC GTCCCGGCAA TGCTCCGCCC CCTGCTGGGC AAGACCGAGA TTTCCCTCGT CCTCCATACC TCTGATAAAC TGGTCGCCCG TGAACGTGCA GCCGCCCTGT ATGCCAGGAC GGGACAGCTT TTCAGAGCAG GCAAGATGCA AAACCCCTCC AGAGAGGACC TTCTGGTCCT CTACAGCGAA TTGATTGAAA ACTACGAAAT CGCTCTGAAG GCTCAGGAGG AGAGTGCCAG AAAAGTGCTG GAAGCCGAAC GGGCCCAGCA TCTCATTGAA AAAGCCGATC TGGTCTCACG ACAGTCGACC TTCCTCAACA AGGTCCAGCC CCACATGGAC AGCCTTCTTC AGGCACTGGG GCAACTGAAG CACACGCTCG ACAGACGGGA TGTCTGGAAC GTCGCTGAAA AACAGGGACT GCAGAACCAG ATCACCACCC TCAGCACCCT GATCCAGACC AGCCTGCATA GAGGTTCAGA GAACCCGGAA GAACAAACAC CTTCCCCCAA ACAGCCACGC AGCATGAAAC TGTCGGCGGC GGCTGAGAAA TTTGTCTTCT CGGTTTCCTC CAAAAGTGCC GGGACCATCA AGGGAACGGG CAAGACCGTA GCCCTCTTCA CCGAAGCCTT CGGCGATATG CCAGTTCATC AGGTAACCGG GGAAGTCGTG GGAGAATTTT ACGACCTACT GTCAGGCCTC CCGGCAACAC ATGGCAAAGG CAAGACCACT CTTCCCCTTC GGGACGCGGT CCGAGAGGCC CAAAAAAGCG AGGGAGAAAC CGTATCGGGC AAAACCGTCA AAAACCATTT CTCGCGCATG TCCGCGATCT GGAGTGAACT GGTCCGGCGT GACCACGCCC CCAAAAATCC ATGGGCCAAC TGGTCTTTCG ATCTGACGCA GAAAACCAAC CGCCGGGCCT GGTTGGAAGA GGAACTGAGA ACACTCCTGA ACTCGAAATG GCTGGGCCGG GTTTTCCCCG AACGCACCTA TCGGGGCATT GTGCGCATAG CCCTCTATAC CGGCATGCGT CTGGGTGAGA TCTGCAATCT GAGAAATCAG GATATCGAAA CCCTGAACGG CATTCCCTGT TTCCATATCC GGCCCCATAC AGTCGAGATC GACGGCAAAA CCCGGGAATG GTCTCCCAAG ACATCAGCCG GCACCCGTAT CATTCCCATC CACAGCAAAC TGCTGGAAAA AGGGATCATT GAGGAATTCA GAAATTCCGG CCCCTATCTC TTCAGCGAAC TGCCCATCTC CGCTTCCGGG GTCCGAGGCG CAAACTTCGA AATGGTCTTT TCCAAACATA AACGGCGTCT GAACCTGCCA GCGGACGTCA CCTTTCACTC CTTCCGCCAT CTGGTTTCGA CAGTGCTCAG AAACCAGGAC AGTCACATCC GGGAACTCTG GATTGATGAT CTGCTGGGTC ATGAAGCCAC CCACAAAAGT CAGGGCACAA CCCAATACAC GTCAGCTATT GATCTGCAGA ACCTCCAGCG GGTTGTGGAG GCCATTACCT ATCCCGACGA CATCGCAAAC TGGTGA
|
Protein sequence | MLRAIPSRRT GQPRYHFRRI VPAMLRPLLG KTEISLVLHT SDKLVARERA AALYARTGQL FRAGKMQNPS REDLLVLYSE LIENYEIALK AQEESARKVL EAERAQHLIE KADLVSRQST FLNKVQPHMD SLLQALGQLK HTLDRRDVWN VAEKQGLQNQ ITTLSTLIQT SLHRGSENPE EQTPSPKQPR SMKLSAAAEK FVFSVSSKSA GTIKGTGKTV ALFTEAFGDM PVHQVTGEVV GEFYDLLSGL PATHGKGKTT LPLRDAVREA QKSEGETVSG KTVKNHFSRM SAIWSELVRR DHAPKNPWAN WSFDLTQKTN RRAWLEEELR TLLNSKWLGR VFPERTYRGI VRIALYTGMR LGEICNLRNQ DIETLNGIPC FHIRPHTVEI DGKTREWSPK TSAGTRIIPI HSKLLEKGII EEFRNSGPYL FSELPISASG VRGANFEMVF SKHKRRLNLP ADVTFHSFRH LVSTVLRNQD SHIRELWIDD LLGHEATHKS QGTTQYTSAI DLQNLQRVVE AITYPDDIAN W
|
| |