Gene Gdia_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2072 
Symbol 
ID6975499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2298518 
End bp2299543 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content71% 
IMG OID643391602 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002276447 
Protein GI209544218 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.435744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTGCCATG CCAGGTTTCC CTTGGCGGAA CGGGTAATCA AAACAGACCG GATGACACGG 
CAACCGACAG CACTGGCCGT CCTGCTCTTT ATTTTGTGCG GGATCGGAAC GGGGATCGCG
ATCCTGGGAT CGTTCGGGCG GTTCGGCCTG CTGACCCGGA CCGAACGGGC GCTGGTCGTC
TTCGACACCC CGGTCCCGGG ACTGAGCGCC GGCGCGCCGG TCACATTCCG CGGCGTGGCG
CTGGGCCGGG TCGAACAGGT GAACGTCCTG CCCGATCCCG CGCGGGGCCG GACCATCATT
CCGGTCACGA TCCGCGTCCG GCCCGACCTG ATCCGCGTCA TCCCGCCACC CGGCACGTCC
CGGCCGCGCC GTATCGCCCT GGCCGACCTG GTGCGGGACG GATTGCAGGC GCACCTGCAT
TCCCAGAGCC TGGTTGTCGG ACGCAGCGGG ATCGACCTGG ATTTCGCCCC CGGACCGTCC
CCGCCGCCGC ATCCCGGCCT GTCACATCTG ATTGAAATCC CAGCCCGCGA ATCCCACTGG
CAGGTGCTGC GCCGCACTCT GGCCACCCTG CCGATCCACG CCATGGCGGC ACAATGGCAG
CAGGCACGGG CGGACGGCCG GAACATCGCC ACCCGGATGG ATGCCACCCT GCCACCCATG
CGCGCCGGCT TTCTGGACGT GCGCGACCGG GCACACGCCA CCGCTGCCGC CCTGAACCGG
GCGGAGACGC AGACCGGCCG CGCCTGGGCC GTCACCCACA CCGATATCGA TCACCTGCAG
GCAACCGCCC GGCGTCAGGT CCATGATCGG GGTGCGGACA TGTCCGCCGT CGCCCGGGGA
GCGCATGCCG TGATAGTGGA GGCGCGGCAG GTACAGGCCG ATCTGCGCGC GCTGGACGCC
GATACCGCGC GCACAGACCT GGCCACGACC GGGCGCGACA TCGCGGCCGC CGGCGCGGCG
CTGCATGACG CGGCCCGGAC CGTGCGTCGG ACGCCGGGGG TTCTGCTGGT CGGGGAGGGG
AAGTAG
 
Protein sequence
MCHARFPLAE RVIKTDRMTR QPTALAVLLF ILCGIGTGIA ILGSFGRFGL LTRTERALVV 
FDTPVPGLSA GAPVTFRGVA LGRVEQVNVL PDPARGRTII PVTIRVRPDL IRVIPPPGTS
RPRRIALADL VRDGLQAHLH SQSLVVGRSG IDLDFAPGPS PPPHPGLSHL IEIPARESHW
QVLRRTLATL PIHAMAAQWQ QARADGRNIA TRMDATLPPM RAGFLDVRDR AHATAAALNR
AETQTGRAWA VTHTDIDHLQ ATARRQVHDR GADMSAVARG AHAVIVEARQ VQADLRALDA
DTARTDLATT GRDIAAAGAA LHDAARTVRR TPGVLLVGEG K