Gene Gdia_0625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0625 
Symbol 
ID6974022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp700643 
End bp702436 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content70% 
IMG OID643390156 
ProductHeparinase II/III family protein 
Protein accessionYP_002275032 
Protein GI209542803 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.27993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCTGC GACGCTGGAG CCAGGACGCG CGTCTTTCAC TGGCACGCCT GCCGTCGCTG 
GCCGGGTTGG GCCGCGTCCC GCCCCAGCCG GTCCATGCCG TTCGCGACCT GTGGCCGGGC
GATGCGGCAT CGGGCGCGCG GCTGCTCCGC GGCAGCCTGT CTCATGCCGG CGTCACGCGG
CCGATCGGCC CGGGCCGGTG GGAGGACCCG TCCTACCCCG AGCGCTTCCG GGCCTGCCTG
CACGGATTCG CCTGGCTGCG CGACCTGCGC GCGGTCGGGA CCGATTCGGC GCGGCTGCAG
GCGCGGGCGC TGGTCGATGA CTGGCTGTCC CATCCGCCCA GCGACCCGAT GGTGCGTGAC
TGCGCCGTGA CGGGCACGCG GCTGGCGTCG TGGCTGGGCC ATTACGATTT CTTCGCCGCT
TCCGCCGATG ACGGGTTTCG CCAGCGCCTG ATGCAGCGCC TGCTGGCGGA AGGACGGACG
ATTGCCGCCC TGATGCCGCC CGAATCGCAT GACTGGCGGG CGCTGGCCGC GTTCAAGGGC
CTGCTGGCGG CGGCGATCGC CATGCCCGAC CATAGCGGCT TCCTGGTGCG GTTCCGGCGC
TATATCGACG CCGAACTGGA ACGGCAGATA CTGCCGGACG GCTGCCATAT CCAGCGCAGC
CCGGAAATCC AGTTCCTGGT GCTGCGCGAA CTGGCGGAAA TGCACGCGAT GCTGCACGCC
GCGCAGATCG CCCCGCCCAT GGCCCTGACC CTGGCGCTGG ATCGGATGAG TCCCGTTCTG
CGCGCGATGC GGCATGGCGA CGGCGGGCTG GCGCTGTTCA ACGGCAGCCA CACGGGCAAT
GTCGCGATGA TCGAGACGGT GCTGTCGCAG GCGACGCGCA CGCGCGTGGT GGCCACCGCG
ATGCCGGACG GCGGCTTCAT CCGCCTGCAG GTCGGCCGGT CGCTGCTGCT GGTGGACGCG
GCCCCCCCGC CGCCGCCCGG CTTCGATGAA GATGCCCATG CCGGCACGCT GTCGTTCGAA
TTTTCGGTGG CGCGGCGGCG GGTGATCGTC AATTGCGGCG CGGGCGAGGG GCCGGAATGG
CGACGGGCCC TGCGCGAAAG CGCGGCCCAT TCCCTGCTGG TTCTGGAGGA TACCTCGTCC
TCGGACTTCG CGCCGCAGGG CGGAATCCTG CGCCGGCCGG TCCATGTGAC GGCCGAACAG
GTGGCCCAGG ACGGCGCGCA TTGGCTGGAC CTGTCCCATG ACGGCTATCA CGCGCCGTTC
GGCGCATCCT GGCGGCGGCG CCTCTATCTG GGAAACGGGG GCGAGGACCT GCGGGGCGAG
GAAATCGTCG AGGGCGAGCG CCAGCAATCC TTCGTGCTGC GCTTCCACCT CCATCCGTCG
GTCGGTGCGG AATGGGATGC CGATGCCCAG ATCGTCATCC TGGACGTCGG CGGCCAGATC
TGGAAATTCC GTGCCGACGG CGGCAAGGTC GCGGTCGAGG AAAGCGTCTA TTGCGGCGGA
ACGACCCCCG AGCGCAGCCG GCAGCTCGTC GTCCGGGTGC GTCCCGGCGA CCATGCGGAC
GAAGATCAGG CAGAGGACAA TCAGGCGGAT ACCGATCAGG CGGATAAGGA GCCGGCGGAT
GAAGGCCGGA CAGATGAGGA TCGGGCCGCC CGGGACCATG CGGATGAAGA CGGGCCGGCC
CGGAACGCCG ACACCCCTGA CGCGGCCCCC CCGGCGAAGC CCGTGCCCCA GGCCGAGAGC
GGCGAGCGTA CACGCCAGGT CGTGCGCTGG GCGTTGATGC AGATGGAAGG GTAG
 
Protein sequence
MVLRRWSQDA RLSLARLPSL AGLGRVPPQP VHAVRDLWPG DAASGARLLR GSLSHAGVTR 
PIGPGRWEDP SYPERFRACL HGFAWLRDLR AVGTDSARLQ ARALVDDWLS HPPSDPMVRD
CAVTGTRLAS WLGHYDFFAA SADDGFRQRL MQRLLAEGRT IAALMPPESH DWRALAAFKG
LLAAAIAMPD HSGFLVRFRR YIDAELERQI LPDGCHIQRS PEIQFLVLRE LAEMHAMLHA
AQIAPPMALT LALDRMSPVL RAMRHGDGGL ALFNGSHTGN VAMIETVLSQ ATRTRVVATA
MPDGGFIRLQ VGRSLLLVDA APPPPPGFDE DAHAGTLSFE FSVARRRVIV NCGAGEGPEW
RRALRESAAH SLLVLEDTSS SDFAPQGGIL RRPVHVTAEQ VAQDGAHWLD LSHDGYHAPF
GASWRRRLYL GNGGEDLRGE EIVEGERQQS FVLRFHLHPS VGAEWDADAQ IVILDVGGQI
WKFRADGGKV AVEESVYCGG TTPERSRQLV VRVRPGDHAD EDQAEDNQAD TDQADKEPAD
EGRTDEDRAA RDHADEDGPA RNADTPDAAP PAKPVPQAES GERTRQVVRW ALMQMEG