Gene Ava_4602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4602 
Symbol 
ID3679952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5756792 
End bp5759152 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content40% 
IMG OID637719957 
Producthydrogenase maturation protein HypF 
Protein accessionYP_325094 
Protein GI75910798 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0068] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00143] [NiFe] hydrogenase maturation protein HypF 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.879827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTG AGGAAATTCG AGTTCGCGGT ACTGTCCAGG GGGTGGGTTT TCGTCCTACG 
GTGTACCGTC TGGCTAAGGC TTGTGGTTTG CGGGGGGATG TTTGTAATGA TGGGGAAGGG
GTGTTAATTC GGGTATGTGG TGACGAAGGG GCTTTGGCGG AGTTTGTTGT TAGATTGCAG
GAAGAATGTC CACCACTGGC GAAAATTAAT GAACTCGTCA GGACACCATA TCAAGGTGAG
TTGAACTTCC ATGATTTTGT TATTTCTCAT AGTGTTAGTG GCGTAGTGAG AACGGAAATT
AGTCCTGATG CGGCTACTTG TGTTGAATGT CAGCGAGAAA TTTTTGACCC CTTTAGCCGT
TTTTACCGTT ATCCCTTTAC TAACTGTACT CATTGCGGCC CCCGGTTGAG TATTATTCGT
GCCATTCCCT ATGACAGATG TAATACCAGT ATGTCTGCGT TTGTCATGTG TTCTGAATGT
GAAAAAGAAT ATCATGATAT CGAAAACCGC CGCTTCCACG CCCAACCTGT CGCCTGTCAC
ACCTGCGGCC CTAAAGCTTG GCTAGAACGG GCTGATGGTA AACCTGTGAC GGCTTCTATG
TTTTCTATGT TGGATGATGT GGATGCTGTT TGTACTTTGT TACAGAAGGG TGAAATTGTC
GCTATTAAGG GAATTGGTGG TTTTCATTTA GCTTGTGATG CTACCCAAGA AGCGGCTGTG
GAGAAATTGC GTCAGCGTAA GCGAAGATAT GATAAGCCTT TTGCTTTGAT GGCACGGGAT
ATTACTGTTA TTGAAAAATA TTGCTTTGTT AATGATAAGG AACGGGAGTT ATTAGATAGT
CCTACTGCAC CGATTGTTTT GTTAGGAAAA AGGACAGAGG AAATAAGTTC TTTCCCTATC
GCCTCCACTG TCGCACCTGG ACAAAATATT CTTGGTTTCA TGCTACCTTA TACGCCGTTG
CATCATTTGA TGCTCAAGCG GATGAATCGC CCAATTGTTT TGACAAGTGG CAATATTTCT
GATGAACCAC AATGTATTGA TAATGATGAA TCGAAAGAAA AGCTAAGTCA TATAGCTGAT
TATTTTCTTC TACATAATCG GGAGATTGTG AATAGAGTTG ATGATTCAGT TGTGCGGGTG
ATTGATAATA AAATACAAAT CATGCGTCGT GCTAGAGGTT ATGCGCCAGC AACGATAAAA
TTACCACCAG GATTTGATAA TATTCACCAA ATTTTAGCGA TGGGTAGTGA GTTAAAAAAC
ACCTTTTGTT TATTGAGAGA TTATGAAGCA ATTCTATCTC AACATTTGGG AGATTTAGAA
AATGCTTTGG CGTTTAATTG TTATCAGGAT ACATTGAATT TGTATTTAAA TTTGTTGCAA
CACAAACCAG AGGCGATCGC CGTTGATTTA CACCCTGAAT ATTTATCCAC AAAACTTGGT
CAAGAATTAG CCGCCGCCAA CCAGATTCAA CTGCAATATA TTCAACATCA TCACGCCCAC
ATTGCCGCTT GTATGGCAGA AAATCTTATT CCTTTAGATT CTTTGCCAGT CTTAGGTATT
GCTTGTGATG GACTAGGTTA CGGTGCAGAT GGTAAACTTT GGGGAGGAGA ATTTCTTTTA
GCTGATTATA CTCAATTCCA ACGCCTGGCT ACATTTAAAC CAGTGGCAAT GATTGGTGGT
GAGCAAGCAA TTTATCAGCC TTGGCGTAAT ACCTATGCTC ATTTATTAAG TGCCAATCTT
TGGGATAATT GTCAATTAAA TTACCATGAC TTAGAAATTA TTAAATTTTT ACAAAAACAG
CCGATAAATT TACTCAATCA ACTTATAGAA CAAAGTATTA ATACTCCTTT AGCTTCTTCT
GTAGGAAGGC TCTTTGATGC TGTGGCTGCG GCTATCGGCA TTTGTCCAGA AAAATGTAGC
TACGAAGGAC AAGCAGCGAT CGCCCTAGAA TCTATAGTCG ATATCCACAC ATTAAATAAT
CCTAAAGAAA CAGCAGTTTA TCCCTTTCAG GTTACTTTTT CCGATAATAT TTATTGTATA
GACTCATGCT CCATGTGGCA ATTATTGCTT GATGACTTAC AGCAGCAAAC TCCTCAACAA
GTTATTGCTG CTAAATTTCA TTTAAGTTTG GCTAATGTCA TTGTGGAGAC AGTCAAACAT
CTTCGTCAAC AAAACTTATT TAATCAAGTC GCCCTAACAG GAGGAGTGTT TCAGAATAGT
ATCTTATTAC AACTAGTCAC TAAGCAGTTA CAAAACTTAG AAATTAACGT ACTTACTCAT
AGCTTAGTTC CTACAAATGA CGGTGGTTTA TCACTAGGTC AAGCAATTAT CACAGCCGCC
AGATTAATGA AAAACTCTTG A
 
Protein sequence
MATEEIRVRG TVQGVGFRPT VYRLAKACGL RGDVCNDGEG VLIRVCGDEG ALAEFVVRLQ 
EECPPLAKIN ELVRTPYQGE LNFHDFVISH SVSGVVRTEI SPDAATCVEC QREIFDPFSR
FYRYPFTNCT HCGPRLSIIR AIPYDRCNTS MSAFVMCSEC EKEYHDIENR RFHAQPVACH
TCGPKAWLER ADGKPVTASM FSMLDDVDAV CTLLQKGEIV AIKGIGGFHL ACDATQEAAV
EKLRQRKRRY DKPFALMARD ITVIEKYCFV NDKERELLDS PTAPIVLLGK RTEEISSFPI
ASTVAPGQNI LGFMLPYTPL HHLMLKRMNR PIVLTSGNIS DEPQCIDNDE SKEKLSHIAD
YFLLHNREIV NRVDDSVVRV IDNKIQIMRR ARGYAPATIK LPPGFDNIHQ ILAMGSELKN
TFCLLRDYEA ILSQHLGDLE NALAFNCYQD TLNLYLNLLQ HKPEAIAVDL HPEYLSTKLG
QELAAANQIQ LQYIQHHHAH IAACMAENLI PLDSLPVLGI ACDGLGYGAD GKLWGGEFLL
ADYTQFQRLA TFKPVAMIGG EQAIYQPWRN TYAHLLSANL WDNCQLNYHD LEIIKFLQKQ
PINLLNQLIE QSINTPLASS VGRLFDAVAA AIGICPEKCS YEGQAAIALE SIVDIHTLNN
PKETAVYPFQ VTFSDNIYCI DSCSMWQLLL DDLQQQTPQQ VIAAKFHLSL ANVIVETVKH
LRQQNLFNQV ALTGGVFQNS ILLQLVTKQL QNLEINVLTH SLVPTNDGGL SLGQAIITAA
RLMKNS