Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0034 |
Symbol | |
ID | 6973423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 39913 |
End bp | 41481 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643389567 |
Product | SpoVR family protein |
Protein accession | YP_002274451 |
Protein GI | 209542222 |
COG category | [S] Function unknown |
COG ID | [COG2719] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0748204 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.0139145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA TCACGCCCAA AGGCGGTGGG GATGGGGGCG GCGCCCGGCC GGGCGGCCTG CTCTATTCCG GCAATGACTG GAACTTCCAG ATCCTCCGCG ATTGCTACGA TGCGATCGCC GAGATCGCGG ACAAGGAACT GGGGCTGGAA CTCTATGCCA ACCGGATCGA GATCATCACG TCCGAACAGA TGCTGGACGT CTATACCTCG CACGGGATGC CGCTGGGGTA CAAGCACTGG TCGTTCGGCA AGCGCTTCAT CGGGCATGAA AACGCCTATC GCCGCGGACT GATGGGCCTG GCCTACGAGG TCGTCATCAA TTCCGATCCC TGCATCAACT ATCTGATGGA GGAAAATTCG GCGACGATGC AGGCGCTGGT CATCGCCCAC GCCGCGTTCG GCCATAACCA TTTTTTCCGC AACAACCGGC TGTTCCGCGA ATGGACGGAC CCGTCGGAGA TTTTGGACTA CCTGGAATTC GCCCGCGGCT TCATCGCGCG GTGCGAGGAA CGGCACGGCG TGCGCGCGGT GGAGCGAATC CTGGATGCCG CCCACGCCCT GCAGAACCAG GGCGTCCACC GTCATTCGGG CGCCCGCAAG CTGGATCTGA AGGCCGAGCA GCAGCGCGCG CGCGAACGGC GCGCCTATGA AGACAGCATG TTCAACGATC TGTGGCGCAC CCTGCCCACC GAACCGGCCG GCGAGGAAGG GCAGGCCGAG GGCGCCCTGG CACGGCGCCT GCTGGGCCTG CCCGAGGAAA ACCTGCTCTA TTTCCTGGAA AAGAACGCCC CCCGCCTGGC GTCGTGGGAG CGCGAGATCA TCCGGATCGT GCGGATGGTC GCGCAATATT TCTACCCGCA GCCCCAGGTG AAGATGATGA ACGAGGGCTG CGCCACCTGG GTGCATTCCT ACATCATGCG CCGGCTGCAT GAACTGGGCC GGATCGACGA CGCGGCGTAT CTGGAAGTCA TCCATTCCAC ATCGAATGTG ATCAGCCAGC CCGGTTTCGA TGCCGGCGGC GGACCGTCCT TCAATCCCTA CGCGCTGGGC TATGCGATGA TGACCGACAT CGCCCGGATC TGCGAGACAC CCACCGAGGA GGACCGGACC TGGTTCCCCG ATATCGCCGG CAACGGCGAC CCGATCGGCA CCCTGCGGCA TGCCTGGGCG GAATACCGGG ATGAAAGCTT CATCCAGCAA TTCCTTTCCC CCAAGGTGAT CCGGGATTTC CGCATGTTCC GCCTGCGCGA CGACACCAGC CAGCCCTACC TGCTGGTCGA CGCGATCCAT GACGAGGCCG GATATCGCGA CATCCGCCGC AGCGTGGCGC TGACCTACGA TCCCGGGACG TTCTATACCG AAATCGAGAT CGTGGATGTG GACCTGCTGG GCGACCGCAC CCTGGTGCTG GAACATCGCA GCCGCACCGG CCAGATGCTC CAGCCCGGCG ATGCGCGGCA GACGCTCGAT TATCTTGCAT TATTATGGGG TTATGGCGTC ATCCTGAAGG AAATCGACAG CCAGACCGGA ACCGTCGTCA CCACCCATTC GGCCAAACCA TCCGCATAA
|
Protein sequence | MNQITPKGGG DGGGARPGGL LYSGNDWNFQ ILRDCYDAIA EIADKELGLE LYANRIEIIT SEQMLDVYTS HGMPLGYKHW SFGKRFIGHE NAYRRGLMGL AYEVVINSDP CINYLMEENS ATMQALVIAH AAFGHNHFFR NNRLFREWTD PSEILDYLEF ARGFIARCEE RHGVRAVERI LDAAHALQNQ GVHRHSGARK LDLKAEQQRA RERRAYEDSM FNDLWRTLPT EPAGEEGQAE GALARRLLGL PEENLLYFLE KNAPRLASWE REIIRIVRMV AQYFYPQPQV KMMNEGCATW VHSYIMRRLH ELGRIDDAAY LEVIHSTSNV ISQPGFDAGG GPSFNPYALG YAMMTDIARI CETPTEEDRT WFPDIAGNGD PIGTLRHAWA EYRDESFIQQ FLSPKVIRDF RMFRLRDDTS QPYLLVDAIH DEAGYRDIRR SVALTYDPGT FYTEIEIVDV DLLGDRTLVL EHRSRTGQML QPGDARQTLD YLALLWGYGV ILKEIDSQTG TVVTTHSAKP SA
|
| |