Gene Aazo_4860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4860 
Symbol 
ID9342667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4971924 
End bp4973159 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content36% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003723129 
Protein GI298492952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0351089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACATC AACAAGGCAG GACGAATGGA AGCACAAATT CACAACTTCA GGGGGGAAGT 
AAAACCAAAT TTCCCCGCTT CTATTCTGAT GATAATTTAA AATTACCACA AAATTGGTCT
TACCTGCCTT TGACCTTATT TTTGGTCATT GGTTTGTTTG CTTGTGGTAA TCCTTCTCCT
CATAAACCAA ATAGAGGGGA TTCATCTAAC CAGGATACCA ATAGTAAGTT AACTTTTTTT
GGCGTTGCTT TAGAACAATT TGATGAAGTA GGTAGACCTA TTTGGAAAGT CAAAGCTAAA
AAGGCAAAAT ATACTAAAGA AAAAGAAATT GGTGAAGCAC AAAATCCCGA TGGTGAACTC
TACCAAGATG GTAAAGTAGT TTACACAATT AGAGGTGAAA CAGCTGATAT TCAGCAGGAT
GGAAAACAGC TATTCCTTAA AGGTAAGATT ATTGCTACCG ATCCCCATAA TGGTACTATC
TTAAAAGGTA ATGAATTAGA ATGGCGACAT AAAGAAGATT TATTAATTGT TCGTAATCAA
TTAAATGGGA CTCATAAAGA ACTACAAGCA ACCGCTCAAG AAGTAAGAGT AAAAACCCGG
GAACAACGAA TAGAATTTGC TGGTAAAGTA GTTGCGATAT CTGCTGATCC TCAGTTGCAA
ATGCGAACTG AAAGGTTAAT TTGGCAGATT AAAGAAGGAA AATTAATTAG CGAGTGCCCC
ATTCAAATTG ACCGCTATAA AGATAATAAA ATCACTGATC GTGGTCAAGG AAATGCTGCA
GAAATTAACT TAAAAACCAA AATTGCTACT ATTGGACCTA AAGCCAAACT AGAGTTAATA
GAACCACCTA TGCAGATAGT TAGTAACTCT ATGACCTGGA ATATCAATCA AGAAACTGTT
AAGGCAAATT CCCCTGTGCG TGTTTTTCAC CAAGCTGAAA ATGTGACTGT AACTGGCAAT
AAAGCAGAAG TAAAGATTCT ACAAAAAAGT GTTTATTTAA CAGGCAATGT GAATGCTGTA
GGACAACACA AGCAATCTTT AAAATCAAAT CTACTTACTT GGTATTTAGA AAGAAAATTA
CTAGAAGCCC AGGGGAATGT GGTTTATCTT CAAGTTGATC CACCGTTAAA TCTTCAAGGT
GCAACCGCAC TGGCTAATCT ACAAACAGAC AATATTGTTG TTAAAGGTGG CAGTTATAAC
GACAGAGTGG TAACAGAAAT TATTCCGCAG GAATGA
 
Protein sequence
MQHQQGRTNG STNSQLQGGS KTKFPRFYSD DNLKLPQNWS YLPLTLFLVI GLFACGNPSP 
HKPNRGDSSN QDTNSKLTFF GVALEQFDEV GRPIWKVKAK KAKYTKEKEI GEAQNPDGEL
YQDGKVVYTI RGETADIQQD GKQLFLKGKI IATDPHNGTI LKGNELEWRH KEDLLIVRNQ
LNGTHKELQA TAQEVRVKTR EQRIEFAGKV VAISADPQLQ MRTERLIWQI KEGKLISECP
IQIDRYKDNK ITDRGQGNAA EINLKTKIAT IGPKAKLELI EPPMQIVSNS MTWNINQETV
KANSPVRVFH QAENVTVTGN KAEVKILQKS VYLTGNVNAV GQHKQSLKSN LLTWYLERKL
LEAQGNVVYL QVDPPLNLQG ATALANLQTD NIVVKGGSYN DRVVTEIIPQ E