Gene Ava_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1683 
Symbol 
ID3682239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2111262 
End bp2112638 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content47% 
IMG OID637717022 
ProductOpcA 
Protein accessionYP_322200 
Protein GI75907904 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3429] Glucose-6-P dehydrogenase subunit 
TIGRFAM ID[TIGR00534] opcA protein 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCC AAGCTCCTAC CATTTTCTCA CTCCAAGCCC CGAAGGACAT TTCGCTGAAC 
GAAATCGAAG CGGAACTTAA TCAAATTTGG CAAAGCTACG GCATCACCGG CGAAGATGGC
GCATTACCTG CGGCTACTCG TGCTACTACA TTTACTCTAG TAGTATATGA ACCAGAAGAA
ACCCAATATC TGTTGGCTTC TTTAGGATTC TACAACGGGC CAATTGATGG CATCTTAGGC
CCACAGACAG AAACCGCACT ACGACAAGTA CAAATCAAGT ACGCACTCCC AGAAACCGGC
ACAGCTACAC CGGAAACTCT GGCTAAACTG CGAGAAGAAT TTGTCAAACG CCAAGGCAAT
TCTGCTAATG GTGAGACTAA TGGCAGTACT TCCTATAGTT ACAGCAGCAC CAGCCCCAGA
ATTGCTGATG AAATCGCCCT CCGTAATCCT TGCCGGATTA TTGCCCTGTT TCCCATTGTT
GGCGAAGATG AAGGGGTAAA GGCTCAAGTT TCTGCCTACT GCCCAATTCA AAAACAATCT
TCCAGTACAC TCATCTGCTG TGAGTACATT ACTCTCAGTG GTACACCAGC AGCATTGGAA
AGAATTGGCG GGATGATTCC CGCATTGTTG ATTGGTGGGT TGCCAAAATT CCTCTGGTGG
AAGGCTACAC CAGACCCCAA CAACATTTTA TTTAAACGCT TGGCCGCAGT TTGCAACAAT
GTGATTGTTG ATTCTTGCAA CTTCAACGAG CCAGAAAGCG ATTTACTCAG CCTGCAAAAG
TTAGTAGAAA CAGGCGTACC TCTAGCTGAT TTAAACTGGC GTAGGCTGGC TGCATGGCAA
GAGTTGACAG CTGAAGCTTA CGATTCTCCC GACCGTCGCG CCGCTTTGGG AGACATTGAC
CGGGTGACAA TTGATTACGA AAAAGGTAAC CCAGCCCAAG CATTGTTATT TTTGGGATGG
TTAGCGAGTC GTTTGGAATG GCAACCCATT TCCTATCAAA AGGATAGCGG AGACTATGAT
ATTACTCGCA TTCACTTTGT TAACCAAGAC CAAAAGCGAG TAGAAGCTGA ATTGGCAGGG
GTTCCAGTTG CGGATGTGGG TGATATTGTG GGCGATTTAA TTGCCTTGCG CCTCAGTTCA
ACCAATCCCC AAGCCAATTG CGGTACAGTC ATCTGCTCAG AAACTGGCGG TTGTATGCGG
ATGGAAACCC ACGGTGGCGC TCAAGCCGCA GGTCTATTTC AACAAGTGAG TTCCTTATCG
GAACAAAAGG CAGAAGCTTT ACTCAGTCAA CAGGTACAAC GCTGGGGACG TGAGTCACTG
TTTGAAGAAA GTTTGGCTTT AATTGGGCAA GTATTTCAGT TAGGCATTAA GAATTAA
 
Protein sequence
MTSQAPTIFS LQAPKDISLN EIEAELNQIW QSYGITGEDG ALPAATRATT FTLVVYEPEE 
TQYLLASLGF YNGPIDGILG PQTETALRQV QIKYALPETG TATPETLAKL REEFVKRQGN
SANGETNGST SYSYSSTSPR IADEIALRNP CRIIALFPIV GEDEGVKAQV SAYCPIQKQS
SSTLICCEYI TLSGTPAALE RIGGMIPALL IGGLPKFLWW KATPDPNNIL FKRLAAVCNN
VIVDSCNFNE PESDLLSLQK LVETGVPLAD LNWRRLAAWQ ELTAEAYDSP DRRAALGDID
RVTIDYEKGN PAQALLFLGW LASRLEWQPI SYQKDSGDYD ITRIHFVNQD QKRVEAELAG
VPVADVGDIV GDLIALRLSS TNPQANCGTV ICSETGGCMR METHGGAQAA GLFQQVSSLS
EQKAEALLSQ QVQRWGRESL FEESLALIGQ VFQLGIKN