Gene Gdia_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1323 
Symbol 
ID6974730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1475755 
End bp1477440 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content67% 
IMG OID643390854 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002275720 
Protein GI209543491 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.404082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.550075 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGACG ACCCCCAAGA TTGTCCCGGT GGCCGGTCTT CCGCCCCGCC CGAGGCTTCG 
GCACGGAAAT ACCGCTTTTC GATCGTCTGG CTGGTGCCCA TCGTGGCGCT GGGCATCGCC
GGCTATCTGG GCTGGCGCGG CTTCATGGGC CGCGGGCCGG AAATCACCAT CACCTTCGAT
ACCGCCGACG GACTGACCAG CGGCCAGACC CAGGTCAAGA ACAAGGCGGT GCCGCTGGGC
ACGGTCCAGG ACGTGGCGCT GACCCCCGAC ATGCGCCATG TCGAGGTGCG CGTGCGCATG
AGCGCCAAGT CCGACCCCAT GCTGACCGAC CACGCCCGGT TCTGGGTGGT GCGGCCGCGC
CTGAACGGCG CCAGCGTCAC GGGGCTGGAG ACGCTGATGA CCGGCGCCTA TATCGCGATG
GACCCGGGAA CGCCCGGCGG CAAGGCCACG ACGCGGTTCA ACGGGCTGGA ATCCCCGCCG
GGCCTGAGGT CCGACCAGCC GGGCAACACT TACACGCTCA TCAGCCCGTC CCTGGGCTCG
ATCGGGCAGG GCGCGCCGGT CTTCTTCCGC GATATCGATG TGGGCGAGGT GCTGGGATAC
ACCATGCCGC CGGGCGGCGT GGGACCGATC CTGATCCAGG TCTTCATCCG CGCGCCCTAT
GACAGCTACC TGCGGACCGA TACGCGCTTC TGGAACGTGT CGGGCGTGCA GGTCGGCTTC
GGGGCCGGCG GCCTGAAGGT CAAGCTGCAA TCGATCCAGG CCCTGTTCTC GGGCGGCGTC
GCGTTCGGCC TGGCGCCGCA GCGGGTCGAC CAGCCGGTGC CCTCGGCGCC CCGGAATTCG
GTCTTCCGTC TCTATGAAAG CCAGGAAGCG GCGGACAATG CCGGCTATCG CGAACGGCTG
TCCCTGGCGA CCTACCTGAC CAATTCGGTG TCCGGCCTGG CGGTCGGGGC GCAGGTCACG
ATGTTCGGCA TCCAGGTCGG CACCGTGACC AGCGTGAAGC TGGACCTGGA CCAGAAGGCC
GGGACGGCCC GGGTGCGGGT GGGCATGGAA ATCCAGCCCG AACGGATTCT GCCGACCGAC
CAGATCCATC ACGACACGAT GGCCGCCACC GTGCAGGCGC TGGTCGATAA CGGGCTGCGG
GCCTCGGTCG ATACGGCCAG CCTGCTGACC GGCGAATCGG TGATCGGCCT GAATTTCGTC
AAGAACGCGA CCCCGGCCAT GGTGCAGGCC GAGGGCACGA CCCTGATCAT CCCCAACAAG
GCGGGCGGGA TCAGCGGCAT CATGGATTCG CTGTCCACTG TCGCGGACAA GATCGCCGCG
ATGCCGCTGA CCCAGGTCGG CGTGAACCTG AACAACCTGC TGGCGCATTC CGACGCACGG
ATCAACAGCC CCGAGGTGCG CCAGGCGATC GTGGCGCTGC GCGATTCGCT GCACAGCATC
CAGGGCCTGG CCGGCGATGC GCGCAGCGGA ATGCATCCGC TGTTCCAGCG CCTGCCGCAG
ATGAGCAAGC AGTTGGACGG CACGCTGAAG AACGCGAACG TGCTGATGGC CAGCTATGGC
GGCGACACGG ACTTCCATCG GGACCTGCAG CAGATGGTGG TGCAGTTGAA CGAGGCGGCG
CGGTCGCTGC GCTTCCTGAC CGATTTCCTC AATCGCCATC CTTCGGCGCT GATTACGGGA
CGCTAG
 
Protein sequence
MTDDPQDCPG GRSSAPPEAS ARKYRFSIVW LVPIVALGIA GYLGWRGFMG RGPEITITFD 
TADGLTSGQT QVKNKAVPLG TVQDVALTPD MRHVEVRVRM SAKSDPMLTD HARFWVVRPR
LNGASVTGLE TLMTGAYIAM DPGTPGGKAT TRFNGLESPP GLRSDQPGNT YTLISPSLGS
IGQGAPVFFR DIDVGEVLGY TMPPGGVGPI LIQVFIRAPY DSYLRTDTRF WNVSGVQVGF
GAGGLKVKLQ SIQALFSGGV AFGLAPQRVD QPVPSAPRNS VFRLYESQEA ADNAGYRERL
SLATYLTNSV SGLAVGAQVT MFGIQVGTVT SVKLDLDQKA GTARVRVGME IQPERILPTD
QIHHDTMAAT VQALVDNGLR ASVDTASLLT GESVIGLNFV KNATPAMVQA EGTTLIIPNK
AGGISGIMDS LSTVADKIAA MPLTQVGVNL NNLLAHSDAR INSPEVRQAI VALRDSLHSI
QGLAGDARSG MHPLFQRLPQ MSKQLDGTLK NANVLMASYG GDTDFHRDLQ QMVVQLNEAA
RSLRFLTDFL NRHPSALITG R