Gene Ava_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4055 
Symbol 
ID3681676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5040544 
End bp5041677 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content46% 
IMG OID637719406 
Productmolybdate metabolism transcriptional regulator 
Protein accessionYP_324554 
Protein GI75910258 
COG category[K] Transcription
[P] Inorganic ion transport and metabolism 
COG ID[COG1476] Predicted transcriptional regulators
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00182658 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.26319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGG ATAGTGGCCT CCTTAATAAC TTGAAAGCAA TCAGAACGCG CTTAGGGATG 
AGCCAGCAAG ATTTGGCTAA CATTGCTAGT GTAACTCGTC AGACCATTAG TGGTGTGGAA
TCGGGACAAT ATGCCCCCTC AGTGGCGATC GCACTACGCA TAGCTAAAGC ACTTGGTTGT
CAAGTTGAGG ATCTATTCTG GCTAGACCAA GACTTACCTA CAATTGAAGC AGTCCTCACA
AAACCCATAC CCGCAGATCA GTCCATACGC CTGAGTTTAG CAAGGGTTGG CGGACAATGG
GTCGCTTATC CCCTATTGGG TAAAGATGCC TTTCGCCAAG ATATGATTCC GGCAGATGGG
GAAGGAGTGA GCCAAACAGG AAGCGGTAAA GTGCAAGTGA GGCTGTTAGA TGATAACTTA
GCCGCACTGC ATAACACAGT AGTGATTGCA GGTTGCTCGC CTGTAATTGC ACTCTGGGCA
AGAGCTACCG AACGCTGGCA TCCACAACTG CGAGTACATT TTACCTTTGC CAACAGCATA
GATGCCCTGC AAAGTCTATG CAGAGGTGAA GCGCACATTG CTGGGATGCA CCTATATGAT
CCCAAAACTG ACGAACATAA CGTGCCGTTT GCCCGTGAAA TCTTAGCAGG AAGGGAAGCT
GTTTTAGTAA CTCTAGGTTT ATGGGAAGAA GGGCTGATAG TCCCATCGGG TAATCCCAAG
GGTTTTAAAA CACTAAACGA TGTAGTAGAA GCACAAGCAA CCATCGTCAA TCGTGAAGTT
GGTGCTGGTA GTCGAATGCT TTTAGAGCAA AAACTGCAAC AAGAACACAT ACCATTTGCA
GCAGTCAAAG GATTTGAGCA GATTGCCACT AGCCATCAAG ATGTTGCCCA AGCCGTCGCA
CTAGGGTTTG TGGATGCAGG TATTAGTACA GCATCCGTCG CTGCTACCTT TGGCTTAGGA
TTTGTACCTC TGCATCAATC AAGATATGAT TTAGTGATTC TGAAGGAATA TTTAGAAGAA
GCACCTATAC AACAATTGTT GAGTACCTTG GGACATCGCA TGGTTCACTC GCAACTAGAA
GTCCTCGGTG GCTATGACAT TACAAAAATT GGGGAAGTTG TAGCAACGGT TTAG
 
Protein sequence
MKQDSGLLNN LKAIRTRLGM SQQDLANIAS VTRQTISGVE SGQYAPSVAI ALRIAKALGC 
QVEDLFWLDQ DLPTIEAVLT KPIPADQSIR LSLARVGGQW VAYPLLGKDA FRQDMIPADG
EGVSQTGSGK VQVRLLDDNL AALHNTVVIA GCSPVIALWA RATERWHPQL RVHFTFANSI
DALQSLCRGE AHIAGMHLYD PKTDEHNVPF AREILAGREA VLVTLGLWEE GLIVPSGNPK
GFKTLNDVVE AQATIVNREV GAGSRMLLEQ KLQQEHIPFA AVKGFEQIAT SHQDVAQAVA
LGFVDAGIST ASVAATFGLG FVPLHQSRYD LVILKEYLEE APIQQLLSTL GHRMVHSQLE
VLGGYDITKI GEVVATV