Gene Ava_4606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4606 
Symbol 
ID3679956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5761449 
End bp5762552 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content40% 
IMG OID637719961 
Producthydrogenase expression/formation protein HypE 
Protein accessionYP_325098 
Protein GI75910802 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0309] Hydrogenase maturation factor 
TIGRFAM ID[TIGR02124] hydrogenase expression/formation protein HypE 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTT CAAAAACTAA CTCAGTATCA AATCCCCTGT TTCAAAAAAT CGAACAAGTC 
CGCCGTCGCC CAGCTAAAGT AAAAGATACT CACATTAATT TGGCACATGG TAGCGGCGGT
AAAGCCATGC GCGACTTAAT TGATGATATT TTTGTTAGTA ATTTTGATAA TCCTCTTCTC
TCACAATTAG AAGACCAAGC CAGCTTGGAT TTATCTAGTC TTTTACAACA GGGAGATAGG
CTAGCTTTTA CCACAGATTC CTATGTTGTA GACCCCTTAT TTTTTCCTGG TAGTGATATA
GGAGAATTAG CAGTAAATGG CACAGTTAAT GACTTAGCTA TGAGTGGTGC TAAACCCTTA
TATCTTACCT GTAGTGTAAT TTTAGAAGAG GGATTACCAA CAGAAACCCT CCGCCGTGTC
GCTACCAGCA TGAAAATAGC CGCCCAAAAA GCTGGAGTGC AAATTGTCAC AGGTGACACT
AAAGTTGTTC ATCGTGGTTG TGCGGATAAA CTCTTTATTA ACACTGCTGG TATTGGTATT
ATTCCCAGTG GCGTTAATAT TTCTGCACAC AATATTCAAC CAGGAGATGC CATAATTGTT
AATGGTGAGT TAGGCAATCA TGGGGCAGCA ATTTTAATCG CTCGTGGGGA ATTAGCTTTA
GAAACAAATA TAGAAAGTGA CTGTCAACCG TTGCATGATT TAGTCGCAAC TATTCTCCAT
GTATGTCCCC AAGTTCACGC CATGCGGGAT GCTACACGAG GAGGTTTAGC AACAGTTTTA
AATGAATTTG CCCTAACTTC TAATGTAGGA ATACGCATTC ATGAAGCATC TATACCTGTG
CGTGAAGAAG TCAAAGGAAT TTGTGAAATT TTGGGTTTAG ACCCTTTATA TTTAGCTAAT
GAAGGCAAAT TAGTAGTAGT GGTTAGGGCT GAACAAGCAC ACACAGTTTT ATCTGCAATG
AAGTCTCACC CGGTGGGTAA AGATGCGTGT ATTATTGGTG AGGTTATTGC TTCTCCTCCA
GGAGTAGTGT TTCTAAAAAC TACTTTTGGT ACAGAACGGA TTATTGATAT GCTAGTAGGC
GATCAACTAC CACGTATTTG TTGA
 
Protein sequence
MDFSKTNSVS NPLFQKIEQV RRRPAKVKDT HINLAHGSGG KAMRDLIDDI FVSNFDNPLL 
SQLEDQASLD LSSLLQQGDR LAFTTDSYVV DPLFFPGSDI GELAVNGTVN DLAMSGAKPL
YLTCSVILEE GLPTETLRRV ATSMKIAAQK AGVQIVTGDT KVVHRGCADK LFINTAGIGI
IPSGVNISAH NIQPGDAIIV NGELGNHGAA ILIARGELAL ETNIESDCQP LHDLVATILH
VCPQVHAMRD ATRGGLATVL NEFALTSNVG IRIHEASIPV REEVKGICEI LGLDPLYLAN
EGKLVVVVRA EQAHTVLSAM KSHPVGKDAC IIGEVIASPP GVVFLKTTFG TERIIDMLVG
DQLPRIC