Gene Gdia_3012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3012 
Symbol 
ID6976446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3297799 
End bp3298869 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content77% 
IMG OID643392520 
Productprotein of unknown function DUF58 
Protein accessionYP_002277357 
Protein GI209545128 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0180166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.251774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCGAT CCTCTCCCTC CCTCCCTGAC CACGGGCCTG ACGACCGGCC GGGCCGCGCG 
GCCCGGCTGC TGCGCCGCCT GCTGCGCCGG CCGCCCCCGA ACGGCCGCGC CACGGCGGGC
GGGGACGCGG CCCTCGATAC GCCCGGCGCG GCCGTTCCCC TGCCCCTGGC GGCGGAAACG
CTGGCCGCCC GCATGCCGGC CCTGATCCTC GCGGCCCAGC GCATCGCCGC GACCGTGGCG
GTGGGCCACC ATGGCAGGCG GCAATCCGGC CCGGGCGAGG ATTTCTGGCA GTTCCGCCCC
GCCCAGCCCG GCGAGCCGGT GACCCGGATC GACTGGCGGC AATCCGCGCG CAGCCTCCGC
GCCTATGTGC GCGAGACCGA GGCCGAGGCC GCCCAGACGC TGTGCCTGTG GTGCGACCCC
AGCGCGTCGA TGCGCTGGCG CTCGGGCGCG GCGCTGCCGC TGAAATCGGA CCGCGCGGTG
CTGCTGGCCC TGGCGGTGGG CACGCTGGCG CTGCGCCAGG GGGAACGGGT GCGGGTGCTG
GCCCCGGACG GCCCCATCGA CATCCCCCCC GGCGGACGGG CGGCCCTGGA CCGGCTGGCC
GTGGCGCTGC TGCGGATCAT GGAGGGCGGA CCGGACAATC CCGGCCTGCC CAATCCGCAC
CAGGTTCCCC GCCATGCAAG GGTCGTGCTG CTGGGCGACG GGCTGGGCGA GATCGCGCCG
CTCGACGCCC TGCTGCGCGG CCTGGCCGCG CGCCCGGCGC GGGCGCACCT GCTGCTGGTC
AACGACCCGG CGGAGGCCAG CCTGCCCTAT GCCGGGCGCG TCCGCTTCGC GGGGCTGGAG
GACGAGGCCG CGATGACCCT GTCGGGCGTC GAAGGCCTGC GCGCCGCCTA TCGCGATGCC
TATGCCCGCC ATCAGGACGA TCTGGCATCC GTGTGCCGCG CCACCGGCCT GGACCTGATC
CGCCATGTCA CCGACCAGCG GCCGGAAACG GCGCTGCTGG CCCTGCACGC CGCCCTGATG
GATCGGGGCG GCGCGGCCGG ACGGGCAGCG CGGGGGGGGC GCGGCCGATG A
 
Protein sequence
MTRSSPSLPD HGPDDRPGRA ARLLRRLLRR PPPNGRATAG GDAALDTPGA AVPLPLAAET 
LAARMPALIL AAQRIAATVA VGHHGRRQSG PGEDFWQFRP AQPGEPVTRI DWRQSARSLR
AYVRETEAEA AQTLCLWCDP SASMRWRSGA ALPLKSDRAV LLALAVGTLA LRQGERVRVL
APDGPIDIPP GGRAALDRLA VALLRIMEGG PDNPGLPNPH QVPRHARVVL LGDGLGEIAP
LDALLRGLAA RPARAHLLLV NDPAEASLPY AGRVRFAGLE DEAAMTLSGV EGLRAAYRDA
YARHQDDLAS VCRATGLDLI RHVTDQRPET ALLALHAALM DRGGAAGRAA RGGRGR