Gene Gdia_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1304 
Symbol 
ID6974709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1456075 
End bp1457340 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content61% 
IMG OID643390833 
Producthypothetical protein 
Protein accessionYP_002275701 
Protein GI209543472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA TATCTCATGT CCTTACGATT ATCGTGTGTT CCGTCGGCGT GGCCGTGGCC 
CTGAAACCCG TTGCCGCGCG CGCCCAGGTG ACCATCTTCC AGAAATCCGG CCCCTCATCC
GATTTCGATG CCTGGCTGCA GGGCATAACC CTGACCGGGC AGATCGAGGG CGGCATCGAC
GCCAATCCCG CCCGGCCCGA CAACGGCATC AATTTCGGCA ACTTCCTGGG GGACCACGCC
AATCAGGTGC AGCTCAACCA GGTGGCGCTG ACCCTTGCCA GGGCCATCGA CCCGACGAAG
GCCGAATACC AGATCGGCTT CACGCTCGAA GCGCTCTACG GCTCGGACGC ACGCTATTAC
CACCTGCTGG GCATTTCCGA CCACATGACG TCGGACCGCT ATCAGCTCAT TCCCGCCCAG
GCCCACGTCG ATACCCACCT GCCATGGCTG ACGAAGGGGG GGCTGAACAT GCAGGCGGGC
ATCCTGCAGG CCCCCATGGG GGTCGAAACC CTGGACCCGA CAACGCGGCC CTTCTATTCT
CTGGCCTATA CGTCGGAATA TTCGGTGCCG TTCCAGCATG TCGGCGCGAT GTTCAAATGG
CACGTGATCG ATATGCTCGA CGTCACCTTC GGCATCGATA CCGGCAACCA GACGACGTTC
GGCCGCAGCG ACAACAATGA CGCACCGGCC GGCTATTTCG GCTTCAACCT GAACAACCTG
GCACACGGCA AACTGACCAT CATCGAACTC AGCCGTGTCG GACCCGAAGA TTCGGTGAAG
GTCCTGGGCT CGCCCGCCAA TCACCTGAAT CGATTCTGGA ACGATATCAA CGCGACCTAT
GCCATCACGG ACAAGCTGTC GGTCACCGGC GAATTCAACT ACCTGCACGA TGACGGGCTG
CGGGCGGATA CGACCAGCTT CGTCAGCTTC CTCAGCTACA AGATCACGCC GACCCTGACC
TTCAATTATC GCGGCGAAAT CTATCGCGAC AATACCGGCC TGTTCGTCGC CAGCTTCCTG
ACCAACCGGG CCTATATGCA GGCCGTCGCC GGCATTCCCG CCCCCGCGGA ATCCGCCCCG
CCGACCACCT ATGGCGAACT GACGCTGGGC GTCACCTACA AGCCGGATCT GGGCCACCAT
ATCCGGGTGT TCGAGATCCG GCCCGAAATC CGCTTCGACC GGTCGCTGAA CGGCACGACG
CCCTTCAACG ACGGACGGAA CACGGGCGTG TTCACGTTCG GCGGCGACGC CGTGCTGGGT
TTCTGA
 
Protein sequence
MKLISHVLTI IVCSVGVAVA LKPVAARAQV TIFQKSGPSS DFDAWLQGIT LTGQIEGGID 
ANPARPDNGI NFGNFLGDHA NQVQLNQVAL TLARAIDPTK AEYQIGFTLE ALYGSDARYY
HLLGISDHMT SDRYQLIPAQ AHVDTHLPWL TKGGLNMQAG ILQAPMGVET LDPTTRPFYS
LAYTSEYSVP FQHVGAMFKW HVIDMLDVTF GIDTGNQTTF GRSDNNDAPA GYFGFNLNNL
AHGKLTIIEL SRVGPEDSVK VLGSPANHLN RFWNDINATY AITDKLSVTG EFNYLHDDGL
RADTTSFVSF LSYKITPTLT FNYRGEIYRD NTGLFVASFL TNRAYMQAVA GIPAPAESAP
PTTYGELTLG VTYKPDLGHH IRVFEIRPEI RFDRSLNGTT PFNDGRNTGV FTFGGDAVLG
F