Gene Ava_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4114 
Symbol 
ID3681502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5128608 
End bp5130233 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content46% 
IMG OID637719461 
Productthiamine pyrophosphate enzyme 
Protein accessionYP_324609 
Protein GI75910313 
COG category[G] Carbohydrate transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG3961] Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.178401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCAAC TAGCACCGCA TATATTTGAT ATTTTATACC AAAAAGGTGT AGAACACGCA 
TTTGGTATAC CTGGGGATTT TGCCTTAACA CTATTCGATG CACTAGCAGA CAGCAAGATT
GCACCTATTG TCATGACGCA CGAACCCTGC GTAGGCTTTG CGGCTGATGC TTACTCCCGA
ATGCGGGGTT TGGGTTTAGC GGTTGTTACC TATAGCGTTG GCGGCTTAAA TATGGTGAAT
GCTGTAGCTG GTGCTTATGC CGAGAAGTCT CCGTTGGTCA TTTTAAGCGG CGGGCCAGGT
GTGCGAGAAC AAAAGGAGCA TGACTTGTTA CATCATAAGG TGAAAACTTT TGACACACAA
CGGCGTGTTT ATGAAGAAGT CACCCTTTAC GCTACAAAGC TAACCGATCC AAAAACAGCC
GATGCTAAAA TTCATCATGC TCTTGACTAT GCAACGACAT TCAAGCGTCC TGTTTATTTG
GAAATTCCCC GTGATCTAGT TTATGCGGAG ATTACAGAGT CAGAACACTT GCCACCACCA
ATCAAGCGCA CTGATCCAGA TACTTTAACG GAAGCCATTG CAGAAACTCT GGAGATGCTC
AAGCGATCGC ACTCACCAGT CATTCTTGCT TGTGTTGAAG TTCATCGCTT TGGCTTGCAG
GAACAACTTC TAGCTTTGGC AGAAAAACTT GGTGTACCAG TCTGCTCAAC AATGTTAGGA
AAATCTGTTT TTCCCGAAAG ACATCCGCAA TACATCGGTA TCTACAATGG TGAGGCTGGG
GATTTAAATG TGCAGAAAAT CGTTGAAGAA TCAGACTGTG TGCTGATGTT GGGAGTGTTT
ATGACTGATA TCAACCTGGG GATGTTTACC GCTCATCTCA ACCCAGGTTT TACAGTGTAT
GCAACTTCAG AACGCCTCGC CATTAAGCAT CACGAATACC CCAATGTGAG GTTTGAAGAT
TACATCACCA CCTTGCTTGA TAGTCCAGAT TTGCCTCATT GGGATTCATC CGGCATCTAT
ACTATGAAAC CGCGTGTTAC GCCATCTGTG GGTAAAATTT CCATGAGTGG ACTACTTTAT
GAACTCAATC AGTTCATTGA CAGTAACACT CTACTAGTGA CGGATGTTGG AGATGCCCTA
TTTGCAGCAG ATGATATCCA GACACAGCAA GGCACATCAT TCTTATGTCC CGCCTTCTAC
GCCAGTATGG GGTTTGGCGT TCCTGGGGTG ATTGGCGCAC AATTAGCCGA CCCATCCCGA
CGAGCGATCG CTTTGGTTGG CGATGGGGCG TTTCACATGA CAGGAATGGA ATTGCTCACA
GCGCAACGCT TAAGACTAAA CCCGATTGTC ATAGTCATTA ACAATGGCTC ATTTGCTAGC
CTCCAGGCAA TGGGACATCA AGAAGCGGCT TTTGTCCAAA TACCCACAAT GGACTATGCC
CAGTTAGCTA ACGTTCTTGG TGGTCATGGC TTTGTGATCC ACACTAGTAC ACAGCTACAA
CAAGCCTTAC AGACAGCGCA AAATAGTAAG ACTTTCAGTA TTCTTGATGT TCATCTCTCA
CCTGACGATG TTTCGCCTGC CTTACAAAGA TTGAGCGCAC TATTTACCAA ATCCCTCAAA
GGATAA
 
Protein sequence
MPQLAPHIFD ILYQKGVEHA FGIPGDFALT LFDALADSKI APIVMTHEPC VGFAADAYSR 
MRGLGLAVVT YSVGGLNMVN AVAGAYAEKS PLVILSGGPG VREQKEHDLL HHKVKTFDTQ
RRVYEEVTLY ATKLTDPKTA DAKIHHALDY ATTFKRPVYL EIPRDLVYAE ITESEHLPPP
IKRTDPDTLT EAIAETLEML KRSHSPVILA CVEVHRFGLQ EQLLALAEKL GVPVCSTMLG
KSVFPERHPQ YIGIYNGEAG DLNVQKIVEE SDCVLMLGVF MTDINLGMFT AHLNPGFTVY
ATSERLAIKH HEYPNVRFED YITTLLDSPD LPHWDSSGIY TMKPRVTPSV GKISMSGLLY
ELNQFIDSNT LLVTDVGDAL FAADDIQTQQ GTSFLCPAFY ASMGFGVPGV IGAQLADPSR
RAIALVGDGA FHMTGMELLT AQRLRLNPIV IVINNGSFAS LQAMGHQEAA FVQIPTMDYA
QLANVLGGHG FVIHTSTQLQ QALQTAQNSK TFSILDVHLS PDDVSPALQR LSALFTKSLK
G