Gene Aazo_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3853 
Symbol 
ID9341658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3905669 
End bp3906802 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content45% 
IMG OID 
ProductXRE family molybdate metabolism transcriptional regulator 
Protein accessionYP_003722489 
Protein GI298492312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.593506 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGG AGAACCACCT CCGTAATAAC TTGAAGTCTA TTAGAACTCG CTTGGGTATG 
AGCCAGCAGG AATTGGCAAA TCTTGCTGCT GTAACTCGTC AAACTATTAG TGGTGTTGAA
TCAGGGCTAT ATGCTCCTTC TGTAGCGATT TCATTGCGCC TAGCTAAAGC TCTTGGTTGT
CAAGTGGAGG AACTATTCTG GCTAGAGCAC GATTTACCTC AAATCGAAGC GGTGCTTACC
AAACCTGTTA ACAACCTCCA ACAATTAAGA GTGAGTTTAG CGCGAGTGGG AGGTCAATGG
ATAGCTTATC CATTGGTTGG TAAGGATGCT TTTCGTCAAG ATATGATTCC TGCTGATGGT
GAAGGTGAGA GGCTGACAGA TAGCAATAAG CTGAATGTCC GTCTGCTCGA TGATAATATG
GACAGGCTTT ATAACACAGT TGTAATTGCT GGGTGTTCGC CTGTGATTTC CCTCTGGGCT
AGAAGTACAG AACGGTGGCA TCCTCAACTT CGGGTACAAT ACAACTTTGC TAATAGTATG
AGGGCATTGC ACAGTTTATG CAGAGGTGAG ACGCATATTG CCGGGATGCA TTTATATGAT
GTGGAAACGG GAGAATATAA CACTCCGTTT GTGCGGAAGG TGCTGGTGGG AAAGGAAGCA
GTAATCATCA CTCTCGGAGT TTGGGAAGAA GGGTTGATGG TACAAGCTGG GAATCCAAAG
CAAATTAAAA GTATCACTGA TGTAGTGGAG ATGGGTGCAG CGATCGCCAA TCGTGAGGTG
GGTTCTGGTA GCCGATCGCT ATTAGAGCAA ACTTTACAAC AGGAGAAAAT ACCATTCCAA
TCACTGCAAG GGTTTGAGTG GATTATGAAT AGTCATCAAG AGGTGGGATG GGCGATCGCA
TCTGACATGG TGGATGCTGG TATTAGTACA GCCTCTGTTG CCATTGCCTT TGGACTAGGA
TTTGTCCCCC TACGTCGGTC ACGATATGAT TTAGTCATTC TGAAAGAATA TATGCAAGAG
CCACCTGTAC AACAATTACT GAGTACTCTC GGACATCGGC TGGTTCACTC ACAATTACAA
ATCCTCGGTG GGTATGATAT TAGCCAAATC GGCGCAGTTG TGGCCACAAT TTAA
 
Protein sequence
MKQENHLRNN LKSIRTRLGM SQQELANLAA VTRQTISGVE SGLYAPSVAI SLRLAKALGC 
QVEELFWLEH DLPQIEAVLT KPVNNLQQLR VSLARVGGQW IAYPLVGKDA FRQDMIPADG
EGERLTDSNK LNVRLLDDNM DRLYNTVVIA GCSPVISLWA RSTERWHPQL RVQYNFANSM
RALHSLCRGE THIAGMHLYD VETGEYNTPF VRKVLVGKEA VIITLGVWEE GLMVQAGNPK
QIKSITDVVE MGAAIANREV GSGSRSLLEQ TLQQEKIPFQ SLQGFEWIMN SHQEVGWAIA
SDMVDAGIST ASVAIAFGLG FVPLRRSRYD LVILKEYMQE PPVQQLLSTL GHRLVHSQLQ
ILGGYDISQI GAVVATI