Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3012 |
Symbol | |
ID | 6976446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3297799 |
End bp | 3298869 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 643392520 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002277357 |
Protein GI | 209545128 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0180166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.251774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCGAT CCTCTCCCTC CCTCCCTGAC CACGGGCCTG ACGACCGGCC GGGCCGCGCG GCCCGGCTGC TGCGCCGCCT GCTGCGCCGG CCGCCCCCGA ACGGCCGCGC CACGGCGGGC GGGGACGCGG CCCTCGATAC GCCCGGCGCG GCCGTTCCCC TGCCCCTGGC GGCGGAAACG CTGGCCGCCC GCATGCCGGC CCTGATCCTC GCGGCCCAGC GCATCGCCGC GACCGTGGCG GTGGGCCACC ATGGCAGGCG GCAATCCGGC CCGGGCGAGG ATTTCTGGCA GTTCCGCCCC GCCCAGCCCG GCGAGCCGGT GACCCGGATC GACTGGCGGC AATCCGCGCG CAGCCTCCGC GCCTATGTGC GCGAGACCGA GGCCGAGGCC GCCCAGACGC TGTGCCTGTG GTGCGACCCC AGCGCGTCGA TGCGCTGGCG CTCGGGCGCG GCGCTGCCGC TGAAATCGGA CCGCGCGGTG CTGCTGGCCC TGGCGGTGGG CACGCTGGCG CTGCGCCAGG GGGAACGGGT GCGGGTGCTG GCCCCGGACG GCCCCATCGA CATCCCCCCC GGCGGACGGG CGGCCCTGGA CCGGCTGGCC GTGGCGCTGC TGCGGATCAT GGAGGGCGGA CCGGACAATC CCGGCCTGCC CAATCCGCAC CAGGTTCCCC GCCATGCAAG GGTCGTGCTG CTGGGCGACG GGCTGGGCGA GATCGCGCCG CTCGACGCCC TGCTGCGCGG CCTGGCCGCG CGCCCGGCGC GGGCGCACCT GCTGCTGGTC AACGACCCGG CGGAGGCCAG CCTGCCCTAT GCCGGGCGCG TCCGCTTCGC GGGGCTGGAG GACGAGGCCG CGATGACCCT GTCGGGCGTC GAAGGCCTGC GCGCCGCCTA TCGCGATGCC TATGCCCGCC ATCAGGACGA TCTGGCATCC GTGTGCCGCG CCACCGGCCT GGACCTGATC CGCCATGTCA CCGACCAGCG GCCGGAAACG GCGCTGCTGG CCCTGCACGC CGCCCTGATG GATCGGGGCG GCGCGGCCGG ACGGGCAGCG CGGGGGGGGC GCGGCCGATG A
|
Protein sequence | MTRSSPSLPD HGPDDRPGRA ARLLRRLLRR PPPNGRATAG GDAALDTPGA AVPLPLAAET LAARMPALIL AAQRIAATVA VGHHGRRQSG PGEDFWQFRP AQPGEPVTRI DWRQSARSLR AYVRETEAEA AQTLCLWCDP SASMRWRSGA ALPLKSDRAV LLALAVGTLA LRQGERVRVL APDGPIDIPP GGRAALDRLA VALLRIMEGG PDNPGLPNPH QVPRHARVVL LGDGLGEIAP LDALLRGLAA RPARAHLLLV NDPAEASLPY AGRVRFAGLE DEAAMTLSGV EGLRAAYRDA YARHQDDLAS VCRATGLDLI RHVTDQRPET ALLALHAALM DRGGAAGRAA RGGRGR
|
| |