Gene Ava_1406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1406 
Symbol 
ID3682700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1734403 
End bp1735749 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content45% 
IMG OID637716743 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_321924 
Protein GI75907628 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.685553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTT CTGGGCTTTT TGTGCTGTCT GTGTTTCTAT TCATCACCGC AGCAGCTTGT 
AACCAGACTA GCGCCTCATT AGATAATTCC CCACCACAAG AAGCATCCAC GTCTGCGGCT
CAATTGGCAC AAAATCCGAC AGAAGGGAAA AACCTTATCC CTACAGAAAC ATTTTCACCT
ACACCTATCC GCATTGACTT ACAGAATTTA CCAGCACCCT TTGCTACAGA TAGTGCATCT
AAAGCCCCGC AAGTTGTACC GATTCCCCAA AAACCAGTTT TGAAAGTCCC CGCCGGGTTT
ACAGTTAACG TTTTTGCTGA AGGTCTAGAT GCACCGCGTT GGCTAGCTTT AACTCCCAGT
GGTGATGTAT TGGTGACGGA AACTAGACAA AACCGGATTC GTTTATTGCG TGATACTAAC
AGCGATGGGG TAGCTGATGT CCGCCAGACT TTTGCTAGCA AAGACAATGG TTTGAATATT
CCTTTTGGGA TGGCGTTTGC TGGTAATGCC TTCTTTTTAG GTAATACTGA TGAGGTTTTA
CAATTTCCCT ACACAGAAGG ACAACAACAA CTCACTGGTA CTGGTAAAAA AATTGCTGAC
CTCCCCGGTG GTGGTTACAA TCAACACTGG ACTCGCAATG TTGTGGCATC ACCCGACGGT
AATAAACTAT ACGTTTCCGT TGGTTCTCGT TCCAATGTGG ATGAAGAAGA ACTACCAAGG
GCTTCTGTAC AGGTAATGAG TTTGGATGGT TCCCAAAAGC AGACTTTTGC CTTTGGCTTA
CGTAACCCTG TTGGTCTTGA CTTTCATCCT GTGACTAGAG AACTTTATAC TACCGTGAAC
GAGAGGGATG GGATTGGTGA TGATTTAGTT CCAGACTACC TGACACGGAT TCGCCAGGAT
GAATTTTATG GTTGGCCTTA TGCCTACTTT ACACCTAAAA ATCTTGACCC CCGGCAAAAG
ACTGGCGGTC AAAGTAAGCG TCCAGATTTA GCAGCGCGTA CTCGTACACC AGATATATTA
TTTCAAGCTC ACTCAGCCGC TTTGGGTTTG CAATTTTATG ATGGTAAAAC TTTTCCTCAA
AGATATCGTA ACGGTGCTTT TGTAGCCTTT CGGGGTTCTT GGAATCGCGA TCGCGGTACT
GGATATAAAG TAGTATTTGT TCCCTTTAAT AGCAAAGGAA GACCGCAAGG TTACTATGAA
GATTTTCTCA CGGGATTTAT GCTAGACCCT GATGTACCAA CCACTTGGGG GCGACCTGTG
GGCTTACTTG TATTACCTGA TGGCAGTCTA TTAGTCACAG AAGAAGCAAA CGATCGCATT
TACCGAATTC AGTATACGGG GGATTAG
 
Protein sequence
MKVSGLFVLS VFLFITAAAC NQTSASLDNS PPQEASTSAA QLAQNPTEGK NLIPTETFSP 
TPIRIDLQNL PAPFATDSAS KAPQVVPIPQ KPVLKVPAGF TVNVFAEGLD APRWLALTPS
GDVLVTETRQ NRIRLLRDTN SDGVADVRQT FASKDNGLNI PFGMAFAGNA FFLGNTDEVL
QFPYTEGQQQ LTGTGKKIAD LPGGGYNQHW TRNVVASPDG NKLYVSVGSR SNVDEEELPR
ASVQVMSLDG SQKQTFAFGL RNPVGLDFHP VTRELYTTVN ERDGIGDDLV PDYLTRIRQD
EFYGWPYAYF TPKNLDPRQK TGGQSKRPDL AARTRTPDIL FQAHSAALGL QFYDGKTFPQ
RYRNGAFVAF RGSWNRDRGT GYKVVFVPFN SKGRPQGYYE DFLTGFMLDP DVPTTWGRPV
GLLVLPDGSL LVTEEANDRI YRIQYTGD