Gene Aazo_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5220 
Symbol 
ID9343027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5343136 
End bp5345031 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content42% 
IMG OID 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003723376 
Protein GI298493199 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.991823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATC TTGGTAAGAA GGCATTGAGA AAACAGCTTC AAACAAAGCG AATTCCTTGG 
ACTGGGGCAT TAGCAGCTAC TATGATGATT CTGCCAGGAA TTTTTGGTGG TAGTCCCGTC
TTGGCTCAAA AAGTAGAGCG TAACTCTTTA TCTTATGGGG AATTACTCCA AAAAACTGAG
CAAGGGGAAG TGAGAAAAGT AGAACTTGAT GAAACTGAAC AAATAGCTAA GGTTTATTTG
GCGGATCAAA AGCCAGATGC ACCACCTATA CCAGTAAGAC TTTTAGAACA AAATACGGAG
TTAATCAATA AACTCAAGGA AAAAAACGTT GAATTCGGTC AGGTTTCCTC CGCTAACAGC
AGGGCTGCTG TAGGGCTATT AATTAATATG ATGTGGATTT TGCCACTGGT AGCTTTAATG
CTGTTGTTCC TGCGTCGTTC TACTAATGGT TCTAACCAAG CAATGAATTT CGGTAAATCT
CGCGCTCGAT TCCAAATGGA AGCGAAAACT GGAGTGAAAT TTGACGATGT AGCTGGAATT
GAGGAAGCAA AGGAAGAATT ACAGGAAGTT GTAACTTTCC TCAAACAGCC AGAAAAATTT
ACTGCTGTGG GCGCTCGTAT TCCTAAAGGT GTACTGTTAG TTGGACCTCC AGGCACTGGT
AAAACTTTAC TAGCTAAGGC GATCGCAGGG GAAGCTGGTG TACCATTTTT CTCAATCTCT
GGTTCTGAAT TTGTAGAAAT GTTTGTCGGT GTGGGTGCTT CCCGTGTCCG CGACTTGTTC
AAGAAAGCTA AAGATAACGC TCCCTGTATC ATCTTTATTG ATGAAATTGA CGCAGTAGGT
AGACAACGGG GTGCAGGTAT TGGTGGTGGT AACGATGAGA GAGAACAAAC CTTAAACCAA
CTATTAACAG AAATGGATGG TTTTGAAGGT AACACAGGGA TTATTATTAT TGCTGCTACC
AACCGTCCTG ATGTTTTAGA CTCTGCATTG TTGCGTCCTG GTCGTTTTGA TAGACAGGTG
ACAGTTGATG CACCGGACAT TAAAGGACGT TTGGAAATTT TAGAAGTCCA TTCCCGCAAT
AAGAAATTAG ATCCTAGTGT ATCCTTAGAT GCCATCGCCC GTCGCACACC TGGCTTTACA
GGTGCAGATT TAGCCAACCT CCTCAACGAA GCAGCCATTC TCACTGCTAG AAGACGCAAA
GACACAATTA CCATCCTAGA AATTGATGAT GCAGTAGACA GAGTAGTAGC TGGGATGGAA
GGTGCAGCTT TAGTAGATAG CAAAAACAAA CGTTTAATCG CCTACCATGA AGTGGGACAC
GCTTTAGTGG GAACTTTAAT TAAAGACCAT GACCCAGTAC AAAAAGTTAC CCTAATTCCC
AGAGGACAAG CATTAGGTTT AACTTGGTTT ACACCCAATG AAGAACAAGG ATTAATTTCT
CGTTCCCAAA TCCTAGCGCG GATTATGGCA GCTTTAGGCG GTCGTGCTGC TGAGGAAATT
GTTTTTGGTA AAGCAGAAGT CACAACAGGT GCTGGAAATG ACTTGGAACA AGTGACAAAT
ATGGCACGAC AAATGGTAAC GCGATTTGGG ATGTCTGACT TAGGTCCGTT GTCCTTGGAA
ACTCAACAAG GAGAAGTATT CTTGGGGCGT GACTGGGGGA ATAAATCAGA ATATTCTGAA
GAAATTTCTT CTAGAATTGA TTCTCAAGTC CGGGGAATTA TTAGCAGTTG TTACATCAAA
GCCAAAGGAA TTCTCCAAGA AAACCGCATA ATTTTAGAAC GTTTAGTTGA TTTATTAGCA
GAACAAGAAA CTATTGATGG TGATTTATTC CGCAAAATTG TTGAAGAAAA TACACAGGTT
CAGGTAAAAG GCCAAAGGTT AGCTGTGCCT CATTAG
 
Protein sequence
MTNLGKKALR KQLQTKRIPW TGALAATMMI LPGIFGGSPV LAQKVERNSL SYGELLQKTE 
QGEVRKVELD ETEQIAKVYL ADQKPDAPPI PVRLLEQNTE LINKLKEKNV EFGQVSSANS
RAAVGLLINM MWILPLVALM LLFLRRSTNG SNQAMNFGKS RARFQMEAKT GVKFDDVAGI
EEAKEELQEV VTFLKQPEKF TAVGARIPKG VLLVGPPGTG KTLLAKAIAG EAGVPFFSIS
GSEFVEMFVG VGASRVRDLF KKAKDNAPCI IFIDEIDAVG RQRGAGIGGG NDEREQTLNQ
LLTEMDGFEG NTGIIIIAAT NRPDVLDSAL LRPGRFDRQV TVDAPDIKGR LEILEVHSRN
KKLDPSVSLD AIARRTPGFT GADLANLLNE AAILTARRRK DTITILEIDD AVDRVVAGME
GAALVDSKNK RLIAYHEVGH ALVGTLIKDH DPVQKVTLIP RGQALGLTWF TPNEEQGLIS
RSQILARIMA ALGGRAAEEI VFGKAEVTTG AGNDLEQVTN MARQMVTRFG MSDLGPLSLE
TQQGEVFLGR DWGNKSEYSE EISSRIDSQV RGIISSCYIK AKGILQENRI ILERLVDLLA
EQETIDGDLF RKIVEENTQV QVKGQRLAVP H