Gene Ava_1429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1429 
Symbol 
ID3682568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1766415 
End bp1768343 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content43% 
IMG OID637716766 
Productglycoside hydrolase, starch-binding 
Protein accessionYP_321947 
Protein GI75907651 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.56449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAA CACCACCATC ACAATTTTCC GAAGAGCAAT ACAAAGTCGA AAATGCTAAT 
GCAGAGGTAC AAGCATTAAT CGAATCGCCT ACGCCAGATA CAGAAATAGA TTTAGAATTT
CTCTACACTA GAGATATTGA ATTTCGCCAA GAAACTATTT ACTTCCTCGT TGTAGATCGC
TTTTATGATG GCGATGCTGA AAATAGCGAA GGTTATAATC CAGAACTTTA TGACCCCAAT
GGGAAAGATT GGGGCAAGTA TTGGGGGGGT GACTTACAAG GAGTAATCGA CAAATTAGAC
TACTTGAAAG ATATGGGAGT AACCGCACTT TGGCTAACTC CTTTATTTGA ACAAGTCGAG
GAATTATTTG TTGGTAACGC AGCCATGCAC GGCTATTGGA CAAAAGATTT TAAACGGCTA
AATCCTCGCT ACATTGGTAA TGGAGAAGAT CCTTCCTTAA ACAATACTCA AGAAACCAGA
AATACGACTT TTGACCGCTT AATTGCAGAA TTACACAAGC GGAAAATGAA GCTGATACTA
GATATTGTCT GTAACCATAG CAGTCCTGAT ACTAGTGGTA GCAAAGGTGA GTTATATGAT
GACGGCGTAA AAATTGCCGA CTTTAATGAT GATGTGAATC ACTGGTATCA CCACTATGGT
GAAGTGCAGA ACTGGGAAGA TGATTGGCAA GTCCAAAACT GTGAACTGGC TGGTTTAGCT
ACTTTTAATG AAAATAATAC TGAGTATCGT AACTATATCA AGTCTGCAAT TAAGCAATGG
CTAGACCGGG GTGTGGATGC GCTGCGGGTA GATACAGTCA AACATATGCC AATTTGGTTT
TGGCAAGAAT TTACTGGTGA TATGTATAAT CACAAACCAG ATGTATTTAT TTTTGGTGAG
TGGATTTACA ATTATCCCAG TGACGATCGC TCGGTGGAAT TTGCCAATAA TTCCGGTATG
ACTTTACTTG ATTTTGGTCT GTGCGTAGCA ATTCGCGGCG CATTAGCCCA AGGTGCGGAA
GGGGGATTCC ATCTCATCCA AGAAATATTC GACCAAGATG ATCGCTACAA CGGGGCTACG
GAGTTAATCA CCTTTGTTGA TAACCATGAT ATGCCCCGCT TTCAATCCCT CAACCCCGAT
CCAGCGATGT TGAAAGTAGC GATCGCTCTC ATTATGACAT CACGGGGTAT TCCATGTATC
TATTACGGTA CAGAACAATA TCTGCACGAT GATACCAACG GCGGTAATGA CCCCTATAAC
CGCCCCATGA TGGAAAATTG GGATACTGAT ACTGAGGTTT ATAGATACCT CCGATTGTTG
TCTGGTATCC GACGGTTAAA TCCAGCCGTC TCTATGGGTA GCCAGTGGCA AAAATACCTC
ACACCCGATG TTTATTGTTA TGTCCGCCGT TATCGTTCTT CTGTTTGCTT CGTCGCCCTT
AATCGTGGTG GAGAGGTGAC TTTACCAGAA GTCCAAACAG ACTTACCAGA TGGTGAACAT
ACTTGTGCGG TGACTCGGAA TAAATTTGAG GTAAAAGACG GTAAAATTTA CAATCTACAA
CTGGAAGAAC GGGGAGTCAT CGTTTTAAGT CATGTAGGCG AACGAGTAAA AGCGCAAACC
ATTATCCGCG TTCAACTTAA TGGTGTACAT ACTCAACCAG GGGAAACTAT CGTAGTTGTC
GGCGACTGTC CAGAATTGGG TAACTGGGAT ATTAGTAAAG CCTATCCCCT GGAATATATC
AACTCTAATA CTTGGTTTGC CGAAATTCCC TTTGATGAAA GTGCAGGCAA ACTCATTAGT
TACAAATATG CCATGTGGCG TGAAGGGCGA TCGCCTCTGC GAGAAAATAC CTTAAATCGT
CGCTGGGTAG TGGCGAAGGA AGGCACTGTT AAATGGCGTG ATACTTGGGC TTCTGGGAGA
GAGTCTTAA
 
Protein sequence
MVQTPPSQFS EEQYKVENAN AEVQALIESP TPDTEIDLEF LYTRDIEFRQ ETIYFLVVDR 
FYDGDAENSE GYNPELYDPN GKDWGKYWGG DLQGVIDKLD YLKDMGVTAL WLTPLFEQVE
ELFVGNAAMH GYWTKDFKRL NPRYIGNGED PSLNNTQETR NTTFDRLIAE LHKRKMKLIL
DIVCNHSSPD TSGSKGELYD DGVKIADFND DVNHWYHHYG EVQNWEDDWQ VQNCELAGLA
TFNENNTEYR NYIKSAIKQW LDRGVDALRV DTVKHMPIWF WQEFTGDMYN HKPDVFIFGE
WIYNYPSDDR SVEFANNSGM TLLDFGLCVA IRGALAQGAE GGFHLIQEIF DQDDRYNGAT
ELITFVDNHD MPRFQSLNPD PAMLKVAIAL IMTSRGIPCI YYGTEQYLHD DTNGGNDPYN
RPMMENWDTD TEVYRYLRLL SGIRRLNPAV SMGSQWQKYL TPDVYCYVRR YRSSVCFVAL
NRGGEVTLPE VQTDLPDGEH TCAVTRNKFE VKDGKIYNLQ LEERGVIVLS HVGERVKAQT
IIRVQLNGVH TQPGETIVVV GDCPELGNWD ISKAYPLEYI NSNTWFAEIP FDESAGKLIS
YKYAMWREGR SPLRENTLNR RWVVAKEGTV KWRDTWASGR ES