Gene Ava_4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4452 
Symbol 
ID3680386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5573021 
End bp5576197 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content43% 
IMG OID637719806 
Productglycoside hydrolase family protein 
Protein accessionYP_324945 
Protein GI75910649 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCTA CCGAATCTCA GTATCAGACA GATTTAATCT CAGACACAAT TGAGAAATTA 
CGCAGTTGTT GTCAAGTTAA TGTTCAATCT ACTTGGTTAT ACCAGGACTC GAACACAGAA
ATTACTGGCG TTGCTACATC TAGTATATCC AATTGGCAAC CTGTAGAGTT AGATACCAAG
GGTAACATTG CTTGGACTGG TGGACAGCAA GTACTTTGGC TAGAACAGAA ATTCGTAGTT
CCCCAAAATT TACATGATTA TCCTTTGGCG GGGTTGTCTT TGCGGCTGTC TCTACTTTGG
TGGGCGGACT CTGCCAAAAT CTACGTGAAT GGGCAATTAG TGCTAGAAGG AGATTTATTT
GATTGTTCCC CCAGAGTTTT ACTCAGTCGG GAAGTATCAC CAGGACAAGA ATTTGTAGTG
GCTTTGCGGC TGGCGAGTCC TGGTCATTGT GATGGTGCTT TAGTGCGATC GCTCCTTGTC
TACGAGTCTA CAGATTATAA TTATCCTGAC CCCGGTTTTA TTGCCGATGA GTTAGCAATA
TTACAGCTTT ATTTAGAAAA GTTCGCCCCA GAAAAGTTAA ATATCCTCAC ACAAGCAATC
CCAGAAATTC ATCCCTCCAA CCCAGAATCT CTAGTTACCT TCCGTCAAAA CCTGATAAAT
CATCTCTCTA TCAGCGACCC AAAATTTAAA ATCTATTTAT TAGGTCACGC TCATTTAGAT
TTAGCATGGC TATGGTCAGT CAGTGAAACT TGGAATGCAG CCCAAAACAC TTTTACATCA
GTCCTCAAAC TACAACAAGA TTTTCCCGAA TTAATCTTCT GTCATTCCAC CCCAGCCCTG
TATGCTTGGA TTGAAGAACA TCGTCCAGAT TTATTTACAG CGATTCAACA AGCTGTAGCT
GCCAAAAAAT GGGAAGTTAT CGGCGGTTTT TGGGTAGAAC CTGACCTTAA TTTAATCGCT
GGGGAATCCA TAGTCCGTCA GTTACTATAC GGTCAACGCT ATTTCCAAGA AAAATTTGGC
AAACTGACGA CTGTAGTTTG GGTTCCCGAC ACCTTTGGTT TTTGTGCAAC CCTACCGCAA
TTTTTGGCGA ATGCAGGTAT TGAGTATTTC GTGACGCAAA AATTACGATG GAACGATACT
ACTAAATTTG ATTATGGGGC TTTTTGGTGG CGATCGCCTG ACGGTAGTCA AGTATTAAGT
GTTATGTCTG CAACCATAGG CGAAGGCATT GACCCCATCA AAATGGCAGC CTATTCTCTA
GAATGGCAAA CCCAAACCTG CTTAACTCAA TCTTTATGGC TTCCTGGTGT CGGTGACCAC
GGCGGCGGCC CCACCCGTGA TATGTTAGAA ACCGCCCAAC GCTGGCAAAC TTCACCATTT
TTCCCAGACC TAGAATTTAT CACCGCCGAA AAGTATCTCC AGCAAATTCA GTCAACGGTC
AATGGTCAAC AGTCAATAGT CAATAGTCAA CGGTCAACAT TTCCTATATG GAATGATGAA
CTATATCTAG AATTTCATCG TGGTTGTTAC ACTACCCACG CAGACCAAAA ACGTTGGAAT
CGGCATTCTG AAAATTTACT ATACGAAGCC GAACTATTTG CTACCTTGGC AACTTTTATC
TGTGGCGTGA CATATCCCAA ATCCGACATC GAAACAGCTT GGAAGCAAGT ATTATTTAAC
CAATTCCACG ATATTTTACC TGGTTCTTCC ATTACCCAAG TATATACAGA TGCCTTGCCC
GAATGGCAGC AAGTCGAACA AACGGGAACC AAAATATTAA AAGAATCATT ACAGGCGATC
GCATCTCACT TTACTCTACC AGAGCCACCA AAAACCGATA GTCTACCCAT TTTCGTTTTC
AATTCTCTCA ACTGGCAGCG CTCTGAGGTA GTATCAGTCA CCCTACCCCC ACCACCACCT
AACCAACAAT GGCAAGTCTA CGATACTACT GGCAAACAAA TCATTTCCCA ATTAACTGAA
CCATCAACCA TACTATTCCT CGCCGAAGAT ATTCCCTCTG TAGGCTATCG CCTCTTTTGG
CTTTCCCCCA CATCGCCCAC ATCTTCCACA TCGCCCACAT CTTCCACATC CCTAGACTAT
ATTCTCGAAA ATGAACACCT GCGCGTTATT GTAGATCCTG ATACTGGAGA TTTATCAAGT
ATCTATGACA AAACTCATCA ACGAGAAGTA TTGTCTGGTG CGGGTAATCA ACTACAAGCT
TTTAAAGACA GTGGTCAATA TTGGGATGCT TGGAATATCG ACCCCAATTA TAGTCAGCAT
CCCCTACCAG CAACTAACCT CAAATCTATC CAGTGGTTAG AACAAGGAAC TGTACAGAAT
TGTCTCCGCG TAGTGCGTCA ATTGGGTAAG TCGGAATTTT GCCAAGACTA TATCTTGCAA
GTCGGCTTAC CCCAATTGAA AATCGTTTCT AGAGTCAATT GGCAAGAAAA GCACGTTTTA
GTCAAAGCAG CGTTTCCTCT CAACGTTACA GCCGACTTTG CCACCTACGA AATTCCCTGC
GGTGCAATTC GTCGCCCGAC TCAACCCCAA ACCCCGCAGG ATAAAGCAAA ATGGGAAGTC
CCAGCTTTAC GTTGGGCTGA TTTAACAGCA GAGACAGATG AGGGTCTTTA CGGTGTTAGT
TTACTGAATG ATTGTAAATA TGGTTACGAC AGTCAACCCC AGCTATTAAG GCTAACCTTA
CTCCGTAGCC CTACTTGGCC TGACCCAGAA GCTGACACAG GCGGCATACA CGAATTTGCT
TATACTGTGT ATCCTCACGC TGATAGCTGG GAATCAGCCC ATACAGTACA AAAGGGATAT
GAATTAAACA TTCCCCTGCA AGTAATATTA AACCCAACTC AACACTTCCA ACTCAACACT
TCCAAATCAA CACCAAACAC CAGAGACAAA GCAAGTTTTT TAAATTTACC AGCCGAGAAT
CTGGTCTTGA TGGCTGTCAA ACCATCGGAA GACGACCAGC AGCAATTAAT TCTGCGCTTT
TATGAATCTC ATGGTGTGAC TACAGAATTA TCTTTGCAGA GCGATTTAAA GTTAACCTTG
GGTATTCCAG TAGATTTACT GGAACGCCCC ATTAGCCAAT TCTCATCTGG GCAACAAATC
TCCACAATTG AACCTTGGAA AATTGCGACT TTTAAAGTTT TAGAGGTCAG AGGCTAG
 
Protein sequence
MTPTESQYQT DLISDTIEKL RSCCQVNVQS TWLYQDSNTE ITGVATSSIS NWQPVELDTK 
GNIAWTGGQQ VLWLEQKFVV PQNLHDYPLA GLSLRLSLLW WADSAKIYVN GQLVLEGDLF
DCSPRVLLSR EVSPGQEFVV ALRLASPGHC DGALVRSLLV YESTDYNYPD PGFIADELAI
LQLYLEKFAP EKLNILTQAI PEIHPSNPES LVTFRQNLIN HLSISDPKFK IYLLGHAHLD
LAWLWSVSET WNAAQNTFTS VLKLQQDFPE LIFCHSTPAL YAWIEEHRPD LFTAIQQAVA
AKKWEVIGGF WVEPDLNLIA GESIVRQLLY GQRYFQEKFG KLTTVVWVPD TFGFCATLPQ
FLANAGIEYF VTQKLRWNDT TKFDYGAFWW RSPDGSQVLS VMSATIGEGI DPIKMAAYSL
EWQTQTCLTQ SLWLPGVGDH GGGPTRDMLE TAQRWQTSPF FPDLEFITAE KYLQQIQSTV
NGQQSIVNSQ RSTFPIWNDE LYLEFHRGCY TTHADQKRWN RHSENLLYEA ELFATLATFI
CGVTYPKSDI ETAWKQVLFN QFHDILPGSS ITQVYTDALP EWQQVEQTGT KILKESLQAI
ASHFTLPEPP KTDSLPIFVF NSLNWQRSEV VSVTLPPPPP NQQWQVYDTT GKQIISQLTE
PSTILFLAED IPSVGYRLFW LSPTSPTSST SPTSSTSLDY ILENEHLRVI VDPDTGDLSS
IYDKTHQREV LSGAGNQLQA FKDSGQYWDA WNIDPNYSQH PLPATNLKSI QWLEQGTVQN
CLRVVRQLGK SEFCQDYILQ VGLPQLKIVS RVNWQEKHVL VKAAFPLNVT ADFATYEIPC
GAIRRPTQPQ TPQDKAKWEV PALRWADLTA ETDEGLYGVS LLNDCKYGYD SQPQLLRLTL
LRSPTWPDPE ADTGGIHEFA YTVYPHADSW ESAHTVQKGY ELNIPLQVIL NPTQHFQLNT
SKSTPNTRDK ASFLNLPAEN LVLMAVKPSE DDQQQLILRF YESHGVTTEL SLQSDLKLTL
GIPVDLLERP ISQFSSGQQI STIEPWKIAT FKVLEVRG