Gene Gdia_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0034 
Symbol 
ID6973423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp39913 
End bp41481 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content63% 
IMG OID643389567 
ProductSpoVR family protein 
Protein accessionYP_002274451 
Protein GI209542222 
COG category[S] Function unknown 
COG ID[COG2719] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0748204 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0139145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA TCACGCCCAA AGGCGGTGGG GATGGGGGCG GCGCCCGGCC GGGCGGCCTG 
CTCTATTCCG GCAATGACTG GAACTTCCAG ATCCTCCGCG ATTGCTACGA TGCGATCGCC
GAGATCGCGG ACAAGGAACT GGGGCTGGAA CTCTATGCCA ACCGGATCGA GATCATCACG
TCCGAACAGA TGCTGGACGT CTATACCTCG CACGGGATGC CGCTGGGGTA CAAGCACTGG
TCGTTCGGCA AGCGCTTCAT CGGGCATGAA AACGCCTATC GCCGCGGACT GATGGGCCTG
GCCTACGAGG TCGTCATCAA TTCCGATCCC TGCATCAACT ATCTGATGGA GGAAAATTCG
GCGACGATGC AGGCGCTGGT CATCGCCCAC GCCGCGTTCG GCCATAACCA TTTTTTCCGC
AACAACCGGC TGTTCCGCGA ATGGACGGAC CCGTCGGAGA TTTTGGACTA CCTGGAATTC
GCCCGCGGCT TCATCGCGCG GTGCGAGGAA CGGCACGGCG TGCGCGCGGT GGAGCGAATC
CTGGATGCCG CCCACGCCCT GCAGAACCAG GGCGTCCACC GTCATTCGGG CGCCCGCAAG
CTGGATCTGA AGGCCGAGCA GCAGCGCGCG CGCGAACGGC GCGCCTATGA AGACAGCATG
TTCAACGATC TGTGGCGCAC CCTGCCCACC GAACCGGCCG GCGAGGAAGG GCAGGCCGAG
GGCGCCCTGG CACGGCGCCT GCTGGGCCTG CCCGAGGAAA ACCTGCTCTA TTTCCTGGAA
AAGAACGCCC CCCGCCTGGC GTCGTGGGAG CGCGAGATCA TCCGGATCGT GCGGATGGTC
GCGCAATATT TCTACCCGCA GCCCCAGGTG AAGATGATGA ACGAGGGCTG CGCCACCTGG
GTGCATTCCT ACATCATGCG CCGGCTGCAT GAACTGGGCC GGATCGACGA CGCGGCGTAT
CTGGAAGTCA TCCATTCCAC ATCGAATGTG ATCAGCCAGC CCGGTTTCGA TGCCGGCGGC
GGACCGTCCT TCAATCCCTA CGCGCTGGGC TATGCGATGA TGACCGACAT CGCCCGGATC
TGCGAGACAC CCACCGAGGA GGACCGGACC TGGTTCCCCG ATATCGCCGG CAACGGCGAC
CCGATCGGCA CCCTGCGGCA TGCCTGGGCG GAATACCGGG ATGAAAGCTT CATCCAGCAA
TTCCTTTCCC CCAAGGTGAT CCGGGATTTC CGCATGTTCC GCCTGCGCGA CGACACCAGC
CAGCCCTACC TGCTGGTCGA CGCGATCCAT GACGAGGCCG GATATCGCGA CATCCGCCGC
AGCGTGGCGC TGACCTACGA TCCCGGGACG TTCTATACCG AAATCGAGAT CGTGGATGTG
GACCTGCTGG GCGACCGCAC CCTGGTGCTG GAACATCGCA GCCGCACCGG CCAGATGCTC
CAGCCCGGCG ATGCGCGGCA GACGCTCGAT TATCTTGCAT TATTATGGGG TTATGGCGTC
ATCCTGAAGG AAATCGACAG CCAGACCGGA ACCGTCGTCA CCACCCATTC GGCCAAACCA
TCCGCATAA
 
Protein sequence
MNQITPKGGG DGGGARPGGL LYSGNDWNFQ ILRDCYDAIA EIADKELGLE LYANRIEIIT 
SEQMLDVYTS HGMPLGYKHW SFGKRFIGHE NAYRRGLMGL AYEVVINSDP CINYLMEENS
ATMQALVIAH AAFGHNHFFR NNRLFREWTD PSEILDYLEF ARGFIARCEE RHGVRAVERI
LDAAHALQNQ GVHRHSGARK LDLKAEQQRA RERRAYEDSM FNDLWRTLPT EPAGEEGQAE
GALARRLLGL PEENLLYFLE KNAPRLASWE REIIRIVRMV AQYFYPQPQV KMMNEGCATW
VHSYIMRRLH ELGRIDDAAY LEVIHSTSNV ISQPGFDAGG GPSFNPYALG YAMMTDIARI
CETPTEEDRT WFPDIAGNGD PIGTLRHAWA EYRDESFIQQ FLSPKVIRDF RMFRLRDDTS
QPYLLVDAIH DEAGYRDIRR SVALTYDPGT FYTEIEIVDV DLLGDRTLVL EHRSRTGQML
QPGDARQTLD YLALLWGYGV ILKEIDSQTG TVVTTHSAKP SA