Gene Aazo_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2804 
Symbol 
ID9340604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2886658 
End bp2888595 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content41% 
IMG OID 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_003721776 
Protein GI298491599 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTAG AAACTAATAA TAATAACCAG ATGAAAACTC CGAAGATTCG GCAGTTTGGT 
GGTAGTTTTT TAATTTTACT GAGTTTGTTG TTGTTGTTGA ACTTGATAGT TCCGAGTTTT
TTTGGACAAC GTTTGCAACA AGTTCCTTAT AGTGATTTTA TTGCTCAGGT AGAAGCGGGT
AAAGTAGATA AAGCGGTTGT GGGGAGTGAT CGCATTGAAT ATGCTATCAA AACCCAAACA
CCAGAAGGTA AGACCGTTGA ACAGGTATTT AGAACCACAC CTGTAGCGAT TGACTTAGAT
TTACCCAAAA TTCTCCGCGA CAATAATGTC GAATTTGCCG CACCACCACG AAACGAAAAT
GCTTGGATTG GTACTGTATT GAGTTGGGTT GCACCTCCGT TAATTTTTTT TGGAATTTGG
GCGTTTTTAA TGAATCATCA AGGTGGTGGA CCTGCTGCGT TAACAGTAGG TAAAAGCAAA
GCACGGATCT ATTCTGAAGG TAGCACTGGG GTAAAATTCC TGGATGTAGC TGGTGTGGAT
GAAGCTAAAG CTGAATTAGA AGAAATCGTT GACTTTCTCA AAAATGCGAC TAAATACACC
AATTTAGGGG CGAAAATTCC CAAAGGTGTA TTGTTAGTTG GACCTCCAGG AACTGGTAAA
ACTTTATTAG CAAAAGCGAT CGCAGGTGAA GCTGGTGTTC CCTTCTTCAG TATTTCGGGT
TCTGAATTTA TCGAATTATT CGTTGGTGTC GGTGCTGCAC GAGTCCGGGA CTTATTTGAA
CAAGCTAAAC AACAAGCACC CTGTATCGTA TTCATTGATG AATTAGACGC ACTGGGTAAA
TCTCGCGGTG GTGCAGGTGG TTTCGTCGGT GGTAACGATG AACGAGAACA AACCCTGAAC
CAATTACTAA CAGAAATGGA TGGTTTTGAT GCCAACACCG GTGTAATTAT CATCGCTGCT
ACCAACCGTC CCGAAGTTCT CGATCCCGCT TTACGTCGTC CCGGACGTTT TGACCGTCAA
ATTGTCGTGG ATAGACCTGA TAAAATCGGT CGAGAAGCAA TTCTTAAAGT CCACGCTAGA
AATGTCAAAC TTGCGGAAGA TGTTGACTTA GGAATTATTG CTACTCGCAC ACCTGGTTTT
GCTGGTGCAG ATTTAGCTAA CTTAGTAAAT GAAGCAGCTT TGTTAGCAGC AAGAAATAAT
CGTCAAGCGG TACTCATGGC AGATTTTAAT GAAGCCATTG AACGTTTAAT AGCAGGTTTA
GAAAAACGTT CTCGTGTATT AAATGAAATC GAGAAGAAAA CCGTTGCTTA TCACGAAGTA
GGACACGCTA TCATCGGTGC ATTAATGCCT GGTGCGGGTA AAGTTGAAAA AATTTCTGTT
GTTCCCCGTG GTATTGGTGC ATTGGGTTAT ACCATTCAAA TGCCAGAAGA AGACCGCTTT
TTGATGGTGG AAGATGAAAT TCGCGGACGC ATTGCCACTT TATTAGGTGG ACGTTCTTCA
GAAGAAATCG TGTTTGGTAA AGTCTCCACT GGTGCTTCTG ACGATATTCA AAAAGCCACT
GATTTAGCAG AACGGGCAAT TACAATTTAT GGTATGAGCG ATAAACTCGG TCCTGTTGCT
TTTGAAAAAA TCCAACAGCA ATTTATTGAA GGTTATGGAA ATCCCCGACG TTCAATTAGT
CCCCAAATGA CGCAAGAAAT TGACCGGGAA GTGAAGGAAA TAGTTGATAA TGCTCACCAC
GTTGCATTAA GTATCCTGCA AAATAACCGC GATTTACTAG AAGAGATTGC CCAGGAACTA
TTGCAAAAAG AAATTCTCGA AGGTAGTTAT TTACGAGAAA GGTTAACTCA AGCCAAAGCA
CCAGATGAAA TGGATGAATG GTTGCGAACT GGTAAGTTAG ATGCTGATAA ACCTTTGCTG
CAAACTCTTT TGGTTTAG
 
Protein sequence
MPVETNNNNQ MKTPKIRQFG GSFLILLSLL LLLNLIVPSF FGQRLQQVPY SDFIAQVEAG 
KVDKAVVGSD RIEYAIKTQT PEGKTVEQVF RTTPVAIDLD LPKILRDNNV EFAAPPRNEN
AWIGTVLSWV APPLIFFGIW AFLMNHQGGG PAALTVGKSK ARIYSEGSTG VKFLDVAGVD
EAKAELEEIV DFLKNATKYT NLGAKIPKGV LLVGPPGTGK TLLAKAIAGE AGVPFFSISG
SEFIELFVGV GAARVRDLFE QAKQQAPCIV FIDELDALGK SRGGAGGFVG GNDEREQTLN
QLLTEMDGFD ANTGVIIIAA TNRPEVLDPA LRRPGRFDRQ IVVDRPDKIG REAILKVHAR
NVKLAEDVDL GIIATRTPGF AGADLANLVN EAALLAARNN RQAVLMADFN EAIERLIAGL
EKRSRVLNEI EKKTVAYHEV GHAIIGALMP GAGKVEKISV VPRGIGALGY TIQMPEEDRF
LMVEDEIRGR IATLLGGRSS EEIVFGKVST GASDDIQKAT DLAERAITIY GMSDKLGPVA
FEKIQQQFIE GYGNPRRSIS PQMTQEIDRE VKEIVDNAHH VALSILQNNR DLLEEIAQEL
LQKEILEGSY LRERLTQAKA PDEMDEWLRT GKLDADKPLL QTLLV