Gene Aazo_4933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4933 
Symbol 
ID9342739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5048630 
End bp5049850 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content41% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723189 
Protein GI298493012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.342711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA ACCTCGTCAA TCTGCCAAAT TCCACACCTG CTTTTGATGG TGATATTTCA 
ACCGAAGACT TATCAGATAT TAACTCTGAG ACTTTATTGC AACTGCTTTG TCAGGAAATG
CAGAGTCAGG TAAAAGCTTC AACTGGATGT GTGCAAGCCG TAACTAAACG CATAGCTAAA
GAAGTAGAAC GGATATGTGA TAAAAGTTCC CGCATCCAAA CTTCAGGGCA AGTTAGGTCT
TGGCAGATTA CTTTAGCAAG ACACCGTTTA CAAAAGTGCC TGCGTTACTA TCAATTAGGC
TCAAAACAAG GGCGTGTAGA ATTACATAGT AGTTTAGGTG CTATTGTTTA TCGCCATGTT
ACTGTTGCTG GCTCAGAATT AGGTTTTGAA GCTCGTTACA ATCTGATTGA AGATTTTCTG
CAAGCTTTTT ATATTGAAGC GATTAAAGCT TTTCGCAGAG AAAATGAATT AGCTGAAGAT
TACACACCAC GTACTCAACT ACAATTAGCT GAGTATATGG CTTTTACGGA GCAGTATGCT
AAACGGCGGA TTAATTTACC TGGTGGTGCT AATCAACAGT TGATTGTGTT ACGCGCTCAA
GGTTTTGCTC GTCGTCAACC CCAAGAAACG ACTGTAGATA TTGAAATGGC TGTGGATTCA
GCTAAGACTG AAGAGGCAGA ATCTTATCAA CGTAATTTGG CCGTGCAACA AATTAGGTCA
CAGATGGTTG CTAAACCTAA TTTTGATCCA TCTGAGGAGT CGGAACGCGA TCGCGTGATT
ACAGAGTTGA TGAAATATCT GGAATCTCAA GGTCAAGCTG ATTGCATGGA TTACCTGTCT
CTTAAACTTC AGGATCTCTC AGCACCGGAA ATTGACCAAA TTTTAGGATT AACTAGCCGT
CAGCGCGATT ATTTGCAACA ACGCTTTAAG TATCACGTTG AGAAGTTTGC TAAACAACAC
CACTGGCAAC TAGTACATCA ATGGCTGGGT GCTGGTTTAG AACATAAGTT GGGTTTATCT
TCTCAGCAGT GGGATGCTTT TTGGAATCAA CTCACAGAAC AGCAACAGCA AATCTTTCAG
CTAAAAACTC TAATGGAGAA TGATCAAGTG ATCGCTAAAG CTGTCCAATG TACCCCTAAA
CAACTACAAA AACGCTGGAC TCAAATGCTA GAACTCGCAT GGGCTATCCG CAATGGTCAT
GCTGAAGTTA AAACCTGCTG A
 
Protein sequence
MKANLVNLPN STPAFDGDIS TEDLSDINSE TLLQLLCQEM QSQVKASTGC VQAVTKRIAK 
EVERICDKSS RIQTSGQVRS WQITLARHRL QKCLRYYQLG SKQGRVELHS SLGAIVYRHV
TVAGSELGFE ARYNLIEDFL QAFYIEAIKA FRRENELAED YTPRTQLQLA EYMAFTEQYA
KRRINLPGGA NQQLIVLRAQ GFARRQPQET TVDIEMAVDS AKTEEAESYQ RNLAVQQIRS
QMVAKPNFDP SEESERDRVI TELMKYLESQ GQADCMDYLS LKLQDLSAPE IDQILGLTSR
QRDYLQQRFK YHVEKFAKQH HWQLVHQWLG AGLEHKLGLS SQQWDAFWNQ LTEQQQQIFQ
LKTLMENDQV IAKAVQCTPK QLQKRWTQML ELAWAIRNGH AEVKTC