Gene Ava_1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1426 
Symbol 
ID3682662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1759819 
End bp1761306 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content45% 
IMG OID637716763 
Productglycoside hydrolase family protein 
Protein accessionYP_321944 
Protein GI75907648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAA ACACCGTCAG AACCTACATC AAACAAACCT GGAAAACCCT CACCCGCTCC 
CACCAACATT TACTAGAATC CGCCCAAGAC ACCAAACTAG AACACCAACC CAACACCCCT
TGGATTATCT ACATCTCCCC CCAAGAAGAC TGTAATACAG TCAAGTCCGT ATTAGAGCGA
TCGCTCTCGA CAAAAGAGAT GCAGCAAATA GAAATTCGCA CCTTACCCAG CGAAGTTGAA
GCCATCCAAG AACACGGATT GTTATATCTC CCAGGGCCTT ACGTCGTCCC TGGTGGTCGT
TTTAATGAAA TGTATGGTTG GGATAGCTAT TTTATTTTGC TTGGACTGTT GCAAGATGAA
GAATGGGAAC TAGCCCAAAG TCAAGTTGAT CAGCTACTTT ACCAAGTCCA GCATTACGGT
ACGATCCTCA ACGCCAACCG TACCTATATG CTGACGCGAT CGCAACCCCC CGTCCTGAGT
ATGATGGTAT TGGCGTTATT CCAGCACACC CAAGACCAAG CATGGCTCAA ATCAACCCTG
CCATTGCTAG AACAGTTTTA CTATTATTGG GTAGTTCCAC CTCACCTAAA TTCAGCCACG
GGCTTATCTA GATACTACGC CTTAGGCGAA GGGGCTGCAC CAGAAGTATT ATTTTCCGAA
CTGGATGAAG CCGGACGCAG CCACTACGAA CGCGTCAAGG AGTATTACAA AACATTTGAG
ATTGATGACT ATGATGTGAG TTTGTTTTAT GACTCAGAAA AAGACGAACT CACAGACTTA
TTTTACAAAG GCGATCGCTC CATGCGTGAG TCTGGTTTTG ATATCACCAA CCGTTTCGGC
CCCTTTAGCG TTGATATCGT CCATTATGCG CCCGTCTGTT TGAACAGCCT GCTTTATCAA
ATGGAGCAAG ACTTAACGCA AATTCACAAG ATTTTAGATA ATCCAGAACT TGCAGAACAA
TGGAGCGATC GCGCTAATAT TCGCCGTGAG CGTATCAATC AATACCTGTG GGATGAAGAA
AAAGGAATTT ATTTAGACTA TCACTTCTAC AGTGGCAAAC GCCGTCATTA TGAATTTGCC
ACTACCTTCT ATCCCCTGTG GACAGGTCTT TCCTCCCCAG AACAAGCCCA ACGCATTGTA
GAAAATCTTT CCTTATTTAC AGCCCCAGGA GGAATATTCA CCAGCACCCG TGTAACGGGG
AACCAATGGG ACGCACCTTT TGGCTGGGCC CCACTCACCT TAATTGCAGT CCAAGGACTT
TATCGTTACG GATATCGCAA GGAGGGGGAT GATATTGCTC ATAAATTCCT AACTATGGCG
ATTCAAGAAT TTACGAAATA CGGTTTTTTT GTAGAAAAAT ACGATGTAGA ACGTTGTTCG
GCTCAAGTTT CCGATGAAAT CTGCTTTGGC TATAGTTCTA ATGAGATAGG TTTTGGTTGG
ACGAATGGAG TCATTTTAGA ACTATTAGCT AACCTTGACG ATTCATGA
 
Protein sequence
MTKNTVRTYI KQTWKTLTRS HQHLLESAQD TKLEHQPNTP WIIYISPQED CNTVKSVLER 
SLSTKEMQQI EIRTLPSEVE AIQEHGLLYL PGPYVVPGGR FNEMYGWDSY FILLGLLQDE
EWELAQSQVD QLLYQVQHYG TILNANRTYM LTRSQPPVLS MMVLALFQHT QDQAWLKSTL
PLLEQFYYYW VVPPHLNSAT GLSRYYALGE GAAPEVLFSE LDEAGRSHYE RVKEYYKTFE
IDDYDVSLFY DSEKDELTDL FYKGDRSMRE SGFDITNRFG PFSVDIVHYA PVCLNSLLYQ
MEQDLTQIHK ILDNPELAEQ WSDRANIRRE RINQYLWDEE KGIYLDYHFY SGKRRHYEFA
TTFYPLWTGL SSPEQAQRIV ENLSLFTAPG GIFTSTRVTG NQWDAPFGWA PLTLIAVQGL
YRYGYRKEGD DIAHKFLTMA IQEFTKYGFF VEKYDVERCS AQVSDEICFG YSSNEIGFGW
TNGVILELLA NLDDS