Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3159 |
Symbol | |
ID | 6976597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3456296 |
End bp | 3457633 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643392671 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002277504 |
Protein GI | 209545275 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.248716 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATTCTTC TGGCCGGCCT GATCGCCATC ATCGCGCTGA TCGCCGTCAG CATCCTGATC TCGCTGTCGG AAATCTCGTT CGCCGCCGCG CGTGATGTCA GGCTGCGCAC CCGGGCCGAG GCCGGGGACA CCAGGGCGGT GAATTTCCTG CGCCTGCGCC GCAACAGCGG GCAGGTCATC ACGGTGCTGC AGATCTGCCT GAACGCCGTC GGCGTGCTGG GCGGCGTCAT CAGCAGCGAA CTGATCACCC CGCCCCTGGC GCTGATCCTG CATTCGGCCG GCCTGGCGGC GGGGCTGGCG GCGGACGTGG CCTCCACCGC GTCCTTCCTG CTGGTGACGG GCCTGTTCGT GCTGTTCGCC GACCTGCTGC CCAAGCGGAT CGCGATGAAC GCGCCGGACC GCGTGGCGCT GGCCATCGGC TGGTTCCCCG CCATCGCCCT GCGGGTGCTG TTTCCCGCCG TCTGGATCTT CTCGAAAATC TCCGACGTCA TCCTGCGCGC GCTGAGGATC CCGGCGGCCT CGGCCGTGGA ACCGGTCACG CCCGAGGACC TGCGCGCGAT CCTGGCCGCC GGCACCGCAT CGGGAATCCT GCTGGAGCAG GAACACCAGA TGATCCAGAA CGTCCTGGGG TTGCAGGACC GGTCCGTGAC CTCGGCCATG ACCCCGCGCG ACGAAATCGT CTTCCTCGAC GTGCAGGAAA GCACCGAGAG CCAGCGCGAC AAGGTCCGCG CCAAGCCCTA TTCGCGCTAT CCGCTGTGCA ATGGCGGGCT GGACAACGTC ATCGGCTCGA TCCGCGCCGA GGACGTGCTG GCCGCCGTGG TCGAGGACGC GCCGACCGCC ACCCCCGGCA CGGTCCTGCC GGCCGCGCGC CAGATCTCGC GCATGCGGCG CGACGTGCTG TCGCTGCCCG ATACGCTGAA TCTGTGGGAC ACGCTGGCCC AGTTCGACAC GCACGGCGCG GGCTTCGCGC TGATCGTGAA CGAATACGGG CTGGTGGTCG GGCTGATCAC GTTCAAGGAC ATCATGGGCG CGCTGATGGA CGGGCTGGCC AACCCGTTCG AGGAACAGGC CATCGTGCGC CGCGACGAAA ATTCGTGGCT GATCGACGGC GCGGCCCCGA TCGGCGACGT GATCCGCGAA CTCGGGATCG CGACGCTGCC CGACAGCCAC AATTTCGATA CGATCGGCGG CTTCATCATG CACCGCCTGC GCCGCATGGC CCGCAAGGCC GACCGCGTCG AGGCCGCGAA CTTCCTGTTC GAAGTGGTCG ACGTCGAAGG CTTCCGCATC AATCAGCTCC TGGTCACGCG GCGCCCGAGG CGCGCCGAGG CCGACTGA
|
Protein sequence | MILLAGLIAI IALIAVSILI SLSEISFAAA RDVRLRTRAE AGDTRAVNFL RLRRNSGQVI TVLQICLNAV GVLGGVISSE LITPPLALIL HSAGLAAGLA ADVASTASFL LVTGLFVLFA DLLPKRIAMN APDRVALAIG WFPAIALRVL FPAVWIFSKI SDVILRALRI PAASAVEPVT PEDLRAILAA GTASGILLEQ EHQMIQNVLG LQDRSVTSAM TPRDEIVFLD VQESTESQRD KVRAKPYSRY PLCNGGLDNV IGSIRAEDVL AAVVEDAPTA TPGTVLPAAR QISRMRRDVL SLPDTLNLWD TLAQFDTHGA GFALIVNEYG LVVGLITFKD IMGALMDGLA NPFEEQAIVR RDENSWLIDG AAPIGDVIRE LGIATLPDSH NFDTIGGFIM HRLRRMARKA DRVEAANFLF EVVDVEGFRI NQLLVTRRPR RAEAD
|
| |