Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2072 |
Symbol | |
ID | 6975499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2298518 |
End bp | 2299543 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643391602 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002276447 |
Protein GI | 209544218 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.435744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTGCCATG CCAGGTTTCC CTTGGCGGAA CGGGTAATCA AAACAGACCG GATGACACGG CAACCGACAG CACTGGCCGT CCTGCTCTTT ATTTTGTGCG GGATCGGAAC GGGGATCGCG ATCCTGGGAT CGTTCGGGCG GTTCGGCCTG CTGACCCGGA CCGAACGGGC GCTGGTCGTC TTCGACACCC CGGTCCCGGG ACTGAGCGCC GGCGCGCCGG TCACATTCCG CGGCGTGGCG CTGGGCCGGG TCGAACAGGT GAACGTCCTG CCCGATCCCG CGCGGGGCCG GACCATCATT CCGGTCACGA TCCGCGTCCG GCCCGACCTG ATCCGCGTCA TCCCGCCACC CGGCACGTCC CGGCCGCGCC GTATCGCCCT GGCCGACCTG GTGCGGGACG GATTGCAGGC GCACCTGCAT TCCCAGAGCC TGGTTGTCGG ACGCAGCGGG ATCGACCTGG ATTTCGCCCC CGGACCGTCC CCGCCGCCGC ATCCCGGCCT GTCACATCTG ATTGAAATCC CAGCCCGCGA ATCCCACTGG CAGGTGCTGC GCCGCACTCT GGCCACCCTG CCGATCCACG CCATGGCGGC ACAATGGCAG CAGGCACGGG CGGACGGCCG GAACATCGCC ACCCGGATGG ATGCCACCCT GCCACCCATG CGCGCCGGCT TTCTGGACGT GCGCGACCGG GCACACGCCA CCGCTGCCGC CCTGAACCGG GCGGAGACGC AGACCGGCCG CGCCTGGGCC GTCACCCACA CCGATATCGA TCACCTGCAG GCAACCGCCC GGCGTCAGGT CCATGATCGG GGTGCGGACA TGTCCGCCGT CGCCCGGGGA GCGCATGCCG TGATAGTGGA GGCGCGGCAG GTACAGGCCG ATCTGCGCGC GCTGGACGCC GATACCGCGC GCACAGACCT GGCCACGACC GGGCGCGACA TCGCGGCCGC CGGCGCGGCG CTGCATGACG CGGCCCGGAC CGTGCGTCGG ACGCCGGGGG TTCTGCTGGT CGGGGAGGGG AAGTAG
|
Protein sequence | MCHARFPLAE RVIKTDRMTR QPTALAVLLF ILCGIGTGIA ILGSFGRFGL LTRTERALVV FDTPVPGLSA GAPVTFRGVA LGRVEQVNVL PDPARGRTII PVTIRVRPDL IRVIPPPGTS RPRRIALADL VRDGLQAHLH SQSLVVGRSG IDLDFAPGPS PPPHPGLSHL IEIPARESHW QVLRRTLATL PIHAMAAQWQ QARADGRNIA TRMDATLPPM RAGFLDVRDR AHATAAALNR AETQTGRAWA VTHTDIDHLQ ATARRQVHDR GADMSAVARG AHAVIVEARQ VQADLRALDA DTARTDLATT GRDIAAAGAA LHDAARTVRR TPGVLLVGEG K
|
| |