Gene Gdia_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2949 
Symbol 
ID6976383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3227999 
End bp3229282 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID643392458 
Productprotein of unknown function DUF195 
Protein accessionYP_002277295 
Protein GI209545066 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.358116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.173006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG CCCTGGGTCC CGGCAGCATC CTGGCCATCG TGGCGCTGGG GGCGGTGGTG 
GTGCTCGCCG TCCTGCTGCT GCGGCGCGGC ACGGGGGCGG GGGCGGCCGG CGAGGGCGAC
CTGCTGGCGC GGCTGTATGT CATGCTGGAA CGCGAAACCG CCGCGCGCAC GGCCGACAGC
GAGGCCCAGC GGGCCCGGCT GGCGGAAATC GAGCGCGTAC TGGCCGCGCG GCTGGACCAG
TCCCGGGCCG AGACCGCCGA CCGGCTGGCC ACGATCGCGC AGTCCATCAC CCGCGACCTG
GGCGATGCGC GCGTCCGCCA GGGCGAGGCA CTGCGCGAGA TGGCCGAAGC CTCCGCCCGG
CAACTGGAGA CGATCCGCAC CGCCGTCAAC GAACGGCTGC ACGAGGCCGT GGAACGGCAG
ATGCAGACCT CGTTCCAGCG CGTGCTGGAA CAGTTCGCCG CGATGCAGAA GGCGATGGGC
GAGGTCACGG CCATGACGGC GCAGATCGGG GACCTGAAGC GCCTGTTTTC CAACGTCAAG
ACCCGGGGCG GCTGGGGCGA GGCGCAGTTG CGCGCCATCC TGGACGACGT GCTGCCGGCC
GGCGCCTACC AGGCCAATTG CCGCCTGCGC GAAGGCAGCG CGGAGGTCGT GGAATTCGCC
GTGCGCATGC CGGTCCGCGC CACGACGCCG CCGGTGCTGG CGATCGATTC CAAATTCCCG
ACCGAGGCCT ATGAACGGCT GCTGGACGCG GTGAACCGCG TGGACGCCGA GGCCGAGCGC
GCCGCCCGCC GCGCGCTGGA AACCACCCTG CGGATCGAGG CGCGCAAGAT CGCGTCCAAA
TATATCGTCC CGCCGGTGAC GGTGGAATTC GCGGTGCTGT ACCTGCCGAC CGACGGGCTG
TATGCCGAGG TCGCGCGCCT CCCCGGCCTG CTGGACGAAA TCGGGCGCAC CTGCCGGGTG
ATCGTCATGG GCCCCGGCCT GCTGCCGGCC ATGCTGCGGA CGATTCACCT GGGCTACGTC
ACCCTGGCGC TGGAGGAGCG GACCGACGGC ATCGCCCGCC TGCTGGGCGC CACCCGGCAG
GAAATGCTCA AGATGGACGG GGTGCTGGAA CGCCTGGCCC GCAACGCCTC GGCCATGTCA
TCCTCGATCG ACGAGGCCAG GCGGCGCACG CGGGTGGTGG CGCGGCAGCT GCGCGGCCTG
GACGGCGTCG AGTCGCTGGT TCCCGAAGGC GCGGGGGATG ACGCGGCGGC CGGGACCGGT
GAAACCGAAT TTAATGGTGC ATGA
 
Protein sequence
MSDALGPGSI LAIVALGAVV VLAVLLLRRG TGAGAAGEGD LLARLYVMLE RETAARTADS 
EAQRARLAEI ERVLAARLDQ SRAETADRLA TIAQSITRDL GDARVRQGEA LREMAEASAR
QLETIRTAVN ERLHEAVERQ MQTSFQRVLE QFAAMQKAMG EVTAMTAQIG DLKRLFSNVK
TRGGWGEAQL RAILDDVLPA GAYQANCRLR EGSAEVVEFA VRMPVRATTP PVLAIDSKFP
TEAYERLLDA VNRVDAEAER AARRALETTL RIEARKIASK YIVPPVTVEF AVLYLPTDGL
YAEVARLPGL LDEIGRTCRV IVMGPGLLPA MLRTIHLGYV TLALEERTDG IARLLGATRQ
EMLKMDGVLE RLARNASAMS SSIDEARRRT RVVARQLRGL DGVESLVPEG AGDDAAAGTG
ETEFNGA