Gene Ava_4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4386 
SymbolaroB 
ID3680572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5497082 
End bp5498173 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content44% 
IMG OID637719739 
Product3-dehydroquinate synthase 
Protein accessionYP_324879 
Protein GI75910583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00709401 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTTCTG TAATTAATGT GAATCTACCA ACGCAGTCTT ATGAGATTGC GATCGCACCT 
GCAAGTTTAG ATCAGATTGG TCAAAGCTTG GCTGGGTTAA AACTGGGCAA GAAAGTATTA
CTGGTTTCTA ATCCCACGAT TTTTAAGCAT TTTGGCAAAG TTGCGGTTGA TTCCTTAGAA
GCTGCTGGAT TTCAAGTAGC AAGTTATTGC TTACCAGCAG GGGAACGCTA CAAAACCCTT
AATTCTATTC AAAAACTCTA CGATATAGCC CTAGAAAATC GCCTAGAACG ATCCTCAACA
ATGGTGGCTT TGGGGGGAGG GGTAATTGGT GATATGACTG GCTTTGCCGC CGCTACTTGG
CTACGGGGGA TTAATGTAGT GCAAGTACCC ACCACACTCT TAGCAATGGT AGATTCGGCT
ATTGGTGGTA AGACAGGTGT TAATCATCCC CACGGGAAAA ACTTGATTGG TGCGTTCCAT
CAGCCGCGAT TTGTGTTAAT TGATCCCCAA GTACTAAAAA CCTTGCCTGT ACGAGAATTT
CGCGCGGGAA TGGCAGAGGT AATTAAGTAT GGCGTGATTT GGGATGCAGA ATTATTCAAC
CAGCTAGAAC AAAGTAAACG TCTCGACCAA CTGCGCTACA TCAAGCCAGA ATTGATGGAT
GCTATCTTAA CTCGTTCATG TCAAGCCAAA GCTGATGTTG TCGGCAAAGA TGAGAAGGAA
GGTGGACTGC GTGCGATTTT GAATTACGGA CACACCGTTG GTCACGCGGT GGAAAGCTTA
ACTAACTATC GGCTACTCAA ACATGGTGAA GCAGTAGGTA TCGGCATGGT AGCGGCTGGG
CAAATTGCTG TAAATTTAGG ACTGTGGCAA CAAGCAGATG CAGACCGTCA AAATGCCTTA
ATTGAAAAGG CGGGTTTACC GACAAAGTTA CCAGCCGGAT TAGATATTGA AGGGATTATT
GAGGCATTGC AATTAGATAA AAAAGTCAAA GATGGTAAAG TACGGTTTGT TTTACCAACT
CAAATTGGTG TAGTGACAGT TACTGACGAG GTGACATCAG ATCACATTCG GCAAGTTTTA
CAGCAGATGT AA
 
Protein sequence
MTSVINVNLP TQSYEIAIAP ASLDQIGQSL AGLKLGKKVL LVSNPTIFKH FGKVAVDSLE 
AAGFQVASYC LPAGERYKTL NSIQKLYDIA LENRLERSST MVALGGGVIG DMTGFAAATW
LRGINVVQVP TTLLAMVDSA IGGKTGVNHP HGKNLIGAFH QPRFVLIDPQ VLKTLPVREF
RAGMAEVIKY GVIWDAELFN QLEQSKRLDQ LRYIKPELMD AILTRSCQAK ADVVGKDEKE
GGLRAILNYG HTVGHAVESL TNYRLLKHGE AVGIGMVAAG QIAVNLGLWQ QADADRQNAL
IEKAGLPTKL PAGLDIEGII EALQLDKKVK DGKVRFVLPT QIGVVTVTDE VTSDHIRQVL
QQM