Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1438 |
Symbol | |
ID | 6974847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1602540 |
End bp | 1604093 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643390969 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_002275833 |
Protein GI | 209543604 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.262723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.636598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTTCTT TTCGAAACAT TTTCGACAGA AACACGTACT TGCCGGGAAA CCGATTGTAC GGAAAAGGAA AGCGCTACCT GGCATCGGAT CAAATTGCGG ATTGGGTAAA AGCCGCGAAG TGCTTCTTAG GAGCTGCGAA CAGAGGCAGA CCGGAAGCCC AGTATCAGCT TGGACTCTGC TATCTCGAAG GCAAAGGCGT GCCGCGCAAC CGAGCACAGG CTTTATGCAG GCTGACTGCA GCGGCCGATG CCGGAATTGC CGAGGCTATG GCCAAGCTTT CCTTTCTGTA TCTCGAAGGG CTTCCCGACT TCGAGGATGC TTCGCATGTT GCTCGACTGG TTAAAGATGA GCGGCATGGC AGAAACAATG TTGACACTGA CAAGGCTCTT TATTGGGCAA ATAAGGCGAT CGCGGGCGGC GTGGCAGAAG GGCACGTCTG CCTTGGTTAT ATCTATTCGT TAGAGAGGTC AGCCTATAAG GATATCGATA TGGCCATAAA GCATTATAGG GCTGCGATGG AGGCGGGAAG CCGCAAGGCA GGACTTGGTC TTGGACAAAC GCTCCTACGG TTTCGGAGTG CGGGCCTTCC TGCTCTCAAG GAGGCGGAGA AGGCGCTTCT TTTTGCCATG GAGGAGAAAA GTCCCGTTGC CGCATATCTT CTGGGTGGAT TGTACGAAGT CGGATCCGGA GTGGAAAAAG ACCTGTCCAG AAGCCGGGAA TTGTACCGGC AGGCCGCCGA AGGCGGCGTG ATCAAGGCCA TGATGCGCCT GAGTACTTTT CTGCTCGACG GCCATGGTGG TCCCCCCAAT CGAACAGCAG GGGAAACCTG GTTGAGACGC GCAGCACTCA AGGGAGACTG GAGCGCCTGC CGTCTTCTGG GCGAAATGTA TGCCGCCAGA CACCATGCAT ATGAAATGAG GCGATGGTAT GAAATAGGAG CCGGACATGG CGATCCGGTT TCGATCTTCA GGCTGGGAGA GGCCATCGAG CGTCTTCAGG AGCCGGGCGG AACGTCCATC GAGGCCGTAG AGTGTTATCT CCTGGCTCTG CAACATGGAT ATGAACCGGC CGTGCGCAAG CTTGCCGCGA GCGTACGGAA TCCGGGCGAA TTCCAGGCTC GCGTCCTCGC GAAGCTCAAC GCCCTATGCG CCCAGGATAT TCCGTTGGCG TATCTTGCCC TGGCGCTGTG CATAAGATCC TTGAAGGGGG GTGACATGGT CATGGCCAGA GAACTGTTCA TCAAGGCTTC TAAGGGGGGA ATCGTGACGG CTCACGTCGC TGCGGGCCAA ATGATCATGA ACGGTCTTGG AGGCAAGGCG GACGCCGGCC TCGCTCGGGA GTTTTTTTTC AAAGGCGCAA AGGCCGGTCA CATCGGCGCC ATGTATGCAC TCGGGAGTTT TCATCACAAG CGAGGCGCAA TCGGGTCGAA TATGGCACAG GCCGCGCTCT GGTATCGCCG AGCGGCATCT GGTGGCCATA AGCATGCGCG GCGCATACTT AGCCGGTCAT CGCCTTGTGG CACGGAGGAG GCGCCGTTAC GGGAGCTTAC ATGA
|
Protein sequence | MVSFRNIFDR NTYLPGNRLY GKGKRYLASD QIADWVKAAK CFLGAANRGR PEAQYQLGLC YLEGKGVPRN RAQALCRLTA AADAGIAEAM AKLSFLYLEG LPDFEDASHV ARLVKDERHG RNNVDTDKAL YWANKAIAGG VAEGHVCLGY IYSLERSAYK DIDMAIKHYR AAMEAGSRKA GLGLGQTLLR FRSAGLPALK EAEKALLFAM EEKSPVAAYL LGGLYEVGSG VEKDLSRSRE LYRQAAEGGV IKAMMRLSTF LLDGHGGPPN RTAGETWLRR AALKGDWSAC RLLGEMYAAR HHAYEMRRWY EIGAGHGDPV SIFRLGEAIE RLQEPGGTSI EAVECYLLAL QHGYEPAVRK LAASVRNPGE FQARVLAKLN ALCAQDIPLA YLALALCIRS LKGGDMVMAR ELFIKASKGG IVTAHVAAGQ MIMNGLGGKA DAGLAREFFF KGAKAGHIGA MYALGSFHHK RGAIGSNMAQ AALWYRRAAS GGHKHARRIL SRSSPCGTEE APLRELT
|
| |