Gene Gdia_2948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2948 
Symbol 
ID6976382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3224672 
End bp3227980 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content74% 
IMG OID643392457 
Producthypothetical protein 
Protein accessionYP_002277294 
Protein GI209545065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.323193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG GTCAAGACCC GTCTGCCCCC GATCGGCCCC GGCTTCGGCG TCCATACCGG 
CGCATCCTGC TGTCGCTCGC GGCCGGCTGT GCCGCCCTGC CGGTCATCGC CGTCGCCGGA
CTGGCGGTGC GGCTTTCGCT GGGGCCGATG GACGTCACCC TGCCGGCCCG GCTGGTCCTG
CCGGTTCCGG TCCTGTCGGG CGGCCATGGC CGCCCGGCGG CGCTGCGGCT GGACATCGGG
CAGGTGCGCG TCGGCTGGGA CGTGCTGCGT GGCGGGCTGA CGGCGCCGCT GGATGTGCGG
GCGGCCGATA TGCGCCTGGT GCGGCCCGAC GGCGTGGTGG CGGACCGGAT CGGCACGGCG
CGGATGGTGC TGGCCGGCGC GCCGCTGCTG CATGGCCGCG TGACGCCCCG GGTGCTGGAA
ATCTCGGCGG TGCGGCTGGC GCTGCGCCGG GCGGCGAACG GTTCGGTCGG GCTGGATACC
GGGCCGTTCC CGGTCGTCGA TAACGGCGCC GCGCCGCTGC CGGTCGATCC CGCGCAGCTT
GACCGGCTGG TGGTGGGGGA CGCGGCCGTG ACCCTGCGCG ACGTGCCCAC GGGCGGGACG
TGGCAGGGGC AGGACCTGGC GGCCGACCTG CGGGCCGCGC GGCAGGGCGG CGCGTTCGGC
GGCACCGTGG GGCTGGTCGG CCGGGCGGGC GTGACCCTGT CCGGGCCCGG CGTGCATGCC
ATGCTGCGGG CCGAGGGACA GCGCGCGGAC GCTGCCGTGT CATGGCATGT ATCCGGCACG
AAGGTGGCGA TGCAGGCGCT GTCGCCCTTC GCGCCGGCGC TGGCGCGTGT CGCCCTGCCG
GTCGGGCTGG ACGGCACGGT GCGCCTGGTC GCCGACCGGT TGGGCCTGCT GGCGCGCCCC
GATACCGTCG CCCTGCGGGT GCAGGCCGGG GCGGGGCGCA TCGCCACCGG GCGCGGGGGC
GTGGTGGTGC TGGCCGGCGC GCAGGCGGAC CTCTCGGCCC GGTTCGGTCC CGCGGCCGAT
GGCGGGGACG TGCCGGGCGC GGTCCATGTG CGGCTGGACG GGCTTCAGGC CAGCCTGCTG
CCCTCGGATC ACCCGGATGC GCCCCCGGAT AACCTGGGTC CCGGCGGCCC GGTGCTGCGG
GCCAGCGGCG CGCTGGACAT CGACAGCCTG GCGCGGCCGG GCCGGATCGG CGTGACGCTG
GCCGGCGATA TTCCGCTGCT GGATTTCGCG ACCCTGGGCG CCTACTGGCC GGCCGGGGCG
GCCCGGGGGG CCAGGACCTG GGTCACGCGA AACATCACGG CCGGCATGGC GCACGACCTG
CACGTCACGG CGGGGCTGGC CAGCACGGCG GGCTGGGGGG CGATGCGCCT GCTGACGCTG
GGCGGCGGGG TCGCGGGGTC GGACCTGGAC CTGCACTGGC TGCGACCGAT CCCGCCGATC
CGGGGCGTGG ACGCCACCCT GACCTTCGAC GGGCCGGATG CGCTGTCCAT CGCCTTTACC
CACGGGGTCC AGACGGTGGA CCGCACCGGG CGCAATGTCG ACGCGACCGG CACCGGGCGG
ATCGCGGTCG GCGACGGGCA CATGCGCATC ACCGGCCTGA TGGTGAAGGA CCAGATGGGC
GACATCTCCA CCACGCTGCA CGGCAACGTG CGGGACGTGC TGGCGCTGCT GGCCGAACCC
CGGCTGAACC TGCTGTCGCG CCATCCGCTG TCGTTCAGCC ATCCGTCGGG CCGGGCCGAC
CTCGCGTTGC ACCTGGTCCT GCCGCTGAAC GCCCATGTCC GGGTGGACGA ACTGCATCTG
GACGGCCATG CCGACCTGGC CCAGGTCCAT CTGGGCAACG TGGTGCTGGG CCGCGCGCTG
GAAGGCGGGC GGCTGGCCAT CGACGCCACC ACCGACGGGC TGGGCCTGCG GGGGACCGGC
GTGCTGGGCG GCGTGCCCTC CACACTGCGC TACGACATGG ATTTCCGCAG CGATCCCGGC
ATCCGGGTGC GCGAGACGGC GCAGCTCCGC GCCCATGTGA CGCCCGAGGT CGCGGAGCGC
GCCGGATTCG CGGTCGCCCA GCGCTTCAGC GGCGCCGCCG ACCTGGATGT CGGCTATGAC
CGCTATGCGG ACGGGACCGG ACAGGTCCGG CTGGACCTGG ACCTGGACGA CGCGGCGCTG
ACCATCCCGG TCTGGAGCAA GGCGCGCGGC CAGGCGGCGC GGGCATCGGC GCGGATCGGG
CTGGCGGACG GGCGCCTGGC CTCGGTCGAG GCCATTCACG CCGCCGGCCC CGACCTGCTG
ATCGACGGCC GGGCGAACGT GGCGGGCGGC ACGGCGCGCA ACCTGATCCT GCGCGGGTTC
CGGGTCGGCC GATCGCGGGG CGACGCCACG ATCGGCGTGC CCGCGGGGCC GCGCGATGCG
GTGCGGGTGG CGATCGACGC CCCGGTGCTG GACCTCTCGC CGCTGCTGGC GCCCGATCCG
GCAGCCGATC ACGGCACCGA CCCCGTCCCG CAGGGACGGG CGGCGGCGGG CTATCACCTG
CCCGTGGCCG CGTCGGGCCG CGTCCATGGG CCGCCGGGGC GAAGCTGGCT GATCGATGCC
AGCGTCCGCA CCCTGTTATA TGCCAAGCAG GCCGCCCTGA CCGGCGTGCA CGCGCATCTG
GAGGATAACG GCGTCCGCCT GACGCGGATG CGCTTCGCCA TGGCCGGCCC GTCGCCCGCC
TCCGCGATCC TGACGCCCGA GTCCGACGGG CGGCATCTGT GGGCCAGCGT CCAGGATCTG
GGCCTGATGC TGCGGGGGCT GGATGTCACG ACGCAGTTCG AAGGCGGCCG GACGGTGCTG
CAGGGCGTCT TCGACGACCG CCAGCCCAGC GCGCCCTTCG CCGGGGTGCT GACGATCGAC
CCGATGACGC TGCACAAGGC CCCGGGGGCG GTGCGGCTGG CCAACGACGC CTCGATCTAT
GGCTGGATGC AGGCGCCAAA GGGGCCGGAT TTCCTGATCC AGCGCGTGTC CCTGCCGCTG
ACCTTTCGGG ACGGCACGCT GCACATCCAT GACGGGGTGC TCAACAATGC CTCGCTGGGC
GTGACGCTGG AAGGGCCGCT GGACCTGGAT CACGGACGGA TGGACCTGCG CGGCACGATC
GTGCCGGCCT TCGCGGTGAA CACCATTCCC GGCCACATGC CCGGGGTCGG CCGGCTGATG
AGCCCGGAAA AGGGCGGCGG CCTGCTGGCC GCGACCTTCG TCGTCAGCGG GGCGATGAAT
GCCCCGGCGC TGAAGGTCAA TCCGTTCTCG ATCTTCCTGC CGGGCGTGCT GCGGCGGCTG
GTGCAGTAG
 
Protein sequence
MTPGQDPSAP DRPRLRRPYR RILLSLAAGC AALPVIAVAG LAVRLSLGPM DVTLPARLVL 
PVPVLSGGHG RPAALRLDIG QVRVGWDVLR GGLTAPLDVR AADMRLVRPD GVVADRIGTA
RMVLAGAPLL HGRVTPRVLE ISAVRLALRR AANGSVGLDT GPFPVVDNGA APLPVDPAQL
DRLVVGDAAV TLRDVPTGGT WQGQDLAADL RAARQGGAFG GTVGLVGRAG VTLSGPGVHA
MLRAEGQRAD AAVSWHVSGT KVAMQALSPF APALARVALP VGLDGTVRLV ADRLGLLARP
DTVALRVQAG AGRIATGRGG VVVLAGAQAD LSARFGPAAD GGDVPGAVHV RLDGLQASLL
PSDHPDAPPD NLGPGGPVLR ASGALDIDSL ARPGRIGVTL AGDIPLLDFA TLGAYWPAGA
ARGARTWVTR NITAGMAHDL HVTAGLASTA GWGAMRLLTL GGGVAGSDLD LHWLRPIPPI
RGVDATLTFD GPDALSIAFT HGVQTVDRTG RNVDATGTGR IAVGDGHMRI TGLMVKDQMG
DISTTLHGNV RDVLALLAEP RLNLLSRHPL SFSHPSGRAD LALHLVLPLN AHVRVDELHL
DGHADLAQVH LGNVVLGRAL EGGRLAIDAT TDGLGLRGTG VLGGVPSTLR YDMDFRSDPG
IRVRETAQLR AHVTPEVAER AGFAVAQRFS GAADLDVGYD RYADGTGQVR LDLDLDDAAL
TIPVWSKARG QAARASARIG LADGRLASVE AIHAAGPDLL IDGRANVAGG TARNLILRGF
RVGRSRGDAT IGVPAGPRDA VRVAIDAPVL DLSPLLAPDP AADHGTDPVP QGRAAAGYHL
PVAASGRVHG PPGRSWLIDA SVRTLLYAKQ AALTGVHAHL EDNGVRLTRM RFAMAGPSPA
SAILTPESDG RHLWASVQDL GLMLRGLDVT TQFEGGRTVL QGVFDDRQPS APFAGVLTID
PMTLHKAPGA VRLANDASIY GWMQAPKGPD FLIQRVSLPL TFRDGTLHIH DGVLNNASLG
VTLEGPLDLD HGRMDLRGTI VPAFAVNTIP GHMPGVGRLM SPEKGGGLLA ATFVVSGAMN
APALKVNPFS IFLPGVLRRL VQ