Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1323 |
Symbol | |
ID | 6974730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1475755 |
End bp | 1477440 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643390854 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002275720 |
Protein GI | 209543491 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.404082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.550075 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGACG ACCCCCAAGA TTGTCCCGGT GGCCGGTCTT CCGCCCCGCC CGAGGCTTCG GCACGGAAAT ACCGCTTTTC GATCGTCTGG CTGGTGCCCA TCGTGGCGCT GGGCATCGCC GGCTATCTGG GCTGGCGCGG CTTCATGGGC CGCGGGCCGG AAATCACCAT CACCTTCGAT ACCGCCGACG GACTGACCAG CGGCCAGACC CAGGTCAAGA ACAAGGCGGT GCCGCTGGGC ACGGTCCAGG ACGTGGCGCT GACCCCCGAC ATGCGCCATG TCGAGGTGCG CGTGCGCATG AGCGCCAAGT CCGACCCCAT GCTGACCGAC CACGCCCGGT TCTGGGTGGT GCGGCCGCGC CTGAACGGCG CCAGCGTCAC GGGGCTGGAG ACGCTGATGA CCGGCGCCTA TATCGCGATG GACCCGGGAA CGCCCGGCGG CAAGGCCACG ACGCGGTTCA ACGGGCTGGA ATCCCCGCCG GGCCTGAGGT CCGACCAGCC GGGCAACACT TACACGCTCA TCAGCCCGTC CCTGGGCTCG ATCGGGCAGG GCGCGCCGGT CTTCTTCCGC GATATCGATG TGGGCGAGGT GCTGGGATAC ACCATGCCGC CGGGCGGCGT GGGACCGATC CTGATCCAGG TCTTCATCCG CGCGCCCTAT GACAGCTACC TGCGGACCGA TACGCGCTTC TGGAACGTGT CGGGCGTGCA GGTCGGCTTC GGGGCCGGCG GCCTGAAGGT CAAGCTGCAA TCGATCCAGG CCCTGTTCTC GGGCGGCGTC GCGTTCGGCC TGGCGCCGCA GCGGGTCGAC CAGCCGGTGC CCTCGGCGCC CCGGAATTCG GTCTTCCGTC TCTATGAAAG CCAGGAAGCG GCGGACAATG CCGGCTATCG CGAACGGCTG TCCCTGGCGA CCTACCTGAC CAATTCGGTG TCCGGCCTGG CGGTCGGGGC GCAGGTCACG ATGTTCGGCA TCCAGGTCGG CACCGTGACC AGCGTGAAGC TGGACCTGGA CCAGAAGGCC GGGACGGCCC GGGTGCGGGT GGGCATGGAA ATCCAGCCCG AACGGATTCT GCCGACCGAC CAGATCCATC ACGACACGAT GGCCGCCACC GTGCAGGCGC TGGTCGATAA CGGGCTGCGG GCCTCGGTCG ATACGGCCAG CCTGCTGACC GGCGAATCGG TGATCGGCCT GAATTTCGTC AAGAACGCGA CCCCGGCCAT GGTGCAGGCC GAGGGCACGA CCCTGATCAT CCCCAACAAG GCGGGCGGGA TCAGCGGCAT CATGGATTCG CTGTCCACTG TCGCGGACAA GATCGCCGCG ATGCCGCTGA CCCAGGTCGG CGTGAACCTG AACAACCTGC TGGCGCATTC CGACGCACGG ATCAACAGCC CCGAGGTGCG CCAGGCGATC GTGGCGCTGC GCGATTCGCT GCACAGCATC CAGGGCCTGG CCGGCGATGC GCGCAGCGGA ATGCATCCGC TGTTCCAGCG CCTGCCGCAG ATGAGCAAGC AGTTGGACGG CACGCTGAAG AACGCGAACG TGCTGATGGC CAGCTATGGC GGCGACACGG ACTTCCATCG GGACCTGCAG CAGATGGTGG TGCAGTTGAA CGAGGCGGCG CGGTCGCTGC GCTTCCTGAC CGATTTCCTC AATCGCCATC CTTCGGCGCT GATTACGGGA CGCTAG
|
Protein sequence | MTDDPQDCPG GRSSAPPEAS ARKYRFSIVW LVPIVALGIA GYLGWRGFMG RGPEITITFD TADGLTSGQT QVKNKAVPLG TVQDVALTPD MRHVEVRVRM SAKSDPMLTD HARFWVVRPR LNGASVTGLE TLMTGAYIAM DPGTPGGKAT TRFNGLESPP GLRSDQPGNT YTLISPSLGS IGQGAPVFFR DIDVGEVLGY TMPPGGVGPI LIQVFIRAPY DSYLRTDTRF WNVSGVQVGF GAGGLKVKLQ SIQALFSGGV AFGLAPQRVD QPVPSAPRNS VFRLYESQEA ADNAGYRERL SLATYLTNSV SGLAVGAQVT MFGIQVGTVT SVKLDLDQKA GTARVRVGME IQPERILPTD QIHHDTMAAT VQALVDNGLR ASVDTASLLT GESVIGLNFV KNATPAMVQA EGTTLIIPNK AGGISGIMDS LSTVADKIAA MPLTQVGVNL NNLLAHSDAR INSPEVRQAI VALRDSLHSI QGLAGDARSG MHPLFQRLPQ MSKQLDGTLK NANVLMASYG GDTDFHRDLQ QMVVQLNEAA RSLRFLTDFL NRHPSALITG R
|
| |