Gene Ava_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4042 
Symbol 
ID3682171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5027389 
End bp5028519 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content45% 
IMG OID637719394 
Productmolybdate metabolism transcriptional regulator 
Protein accessionYP_324542 
Protein GI75910246 
COG category[K] Transcription
[P] Inorganic ion transport and metabolism 
COG ID[COG1476] Predicted transcriptional regulators
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.805509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.419577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTAG ATATCAATAT CTGCAACAAC ATCAAGTCTA TCAGAACCCG TTTAGGCATG 
AGCCAGCAAG ATTTGGCTAA TATAGCTGGC GTAACCCGTC AGACGATTAG TGGTGTGGAA
TCTGGACAAT ATGCGCCTTC TGTCGCCATT ACTTTACGTT TAGCTAAAGC ACTGGGTTGT
CAAGTCGAAA ACTTATTCTG GTTAGAAGAT GATTTACCTG AAATTGAAGC AGTTCTGGCT
AAACCAGTCC CCACTGGACA ACAACTGAGA GTGAGTCTAG CAAAGGTTGG CGGTCAATGG
ATAGCTTATC CTCTCGTGGG CAAGGAAGCT TTTCGGATGG AAATGATTCC TGCTGATGGA
AGAGCAGAGA GTCAGACACA TACGAATAAA GTCCAGGTGC GGCTGCTTGA TGATTTGGAC
AAATTACAAA ATACAGTCGT CATTGCTGGT TGTACACCAG TTATTTCCTT CTTAGCCAGA
GCGACTGAAC GCTGGCATCC CCAGCTACGC GTCCATTATC ATTTTGCTAA TAGTATGGCT
GCCTTGCGTA GTTTAAATCG GGGTGAAGTC CACATTGCAG GGATGCACTT GTACGATCCG
CAAACAGGAG AACATAACAT TCCTTTTGCG CGAGAAGCTT TGGATGGTAG AAGTGCAGTT
TTAATCACTC TGGGCATTTG GGAAGAGGGA CTAGTAGTTG CACCGGGGAA TCCAATGGAA
ATCAAATCCT TATCTGACTT GGTGGAACTA GAAGCCACAA TCATTAACCG CGAACCTGGT
TCTGGTAGTC GGATGCTATT AGAACGCAAA CTCAAAGAGG AAAAAGTATC AGCAAACACA
ATTAAAGGAT ATGAGCATAT TGTCCATAGC CATCAGGATG TAGCCTTATC TATCGCCGCA
GGTATCGCTG ATGCGGGGGT GAGTACGGCT TCTGTAGCGG CGGCTTTTGG TTTGGGATTT
ATCCCCTTGC ATCAAGCGCG GTATGACTTA GTGATTCTCA AGGAATACTT GGAAGAAGCA
CCAGTACAGC AATTTTTGAG TATCTTGGGA CATCGATTAG TGCAATCACA ATTAGAAGTT
CTAGGCGGCT ATGACATCAG CGACATTGGT GAAGTTGTCG CAACTATTTA G
 
Protein sequence
MKLDINICNN IKSIRTRLGM SQQDLANIAG VTRQTISGVE SGQYAPSVAI TLRLAKALGC 
QVENLFWLED DLPEIEAVLA KPVPTGQQLR VSLAKVGGQW IAYPLVGKEA FRMEMIPADG
RAESQTHTNK VQVRLLDDLD KLQNTVVIAG CTPVISFLAR ATERWHPQLR VHYHFANSMA
ALRSLNRGEV HIAGMHLYDP QTGEHNIPFA REALDGRSAV LITLGIWEEG LVVAPGNPME
IKSLSDLVEL EATIINREPG SGSRMLLERK LKEEKVSANT IKGYEHIVHS HQDVALSIAA
GIADAGVSTA SVAAAFGLGF IPLHQARYDL VILKEYLEEA PVQQFLSILG HRLVQSQLEV
LGGYDISDIG EVVATI