Gene Aazo_4610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4610 
Symbol 
ID9342416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4715834 
End bp4718200 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content45% 
IMG OID 
Productpeptidase M23 
Protein accessionYP_003722973 
Protein GI298492796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000163265 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACGAG CATTGAAGAA GAGAGTAAAG GCTGTGTTAA ACAATAACCC CAACAGCGAT 
GCTGCCCCGG TCGAGCAGTT AAATGAGACA AATCACAAAG CAAACTGCCG GGCTAGAACC
CAAGCCGCTA TGATAGGCTT GGCAATCTCA ATGGGCGCAA CCAGCCTTTT GGTGACTCGA
CAAAGCGATC AAGCCCAAGC AGCGGCTCCT GTTGGCAGTC AAAAAGCAGC CTCAACGATT
CCTGCTGTTT CTGACACTGA GATAAAATTT GCCGCCACCA AGCTGGAGTC CCAAGCAGTC
TCTTCAGCGA GGTTGCCTGA AAATCCTGTT ATTCTGGAAC CAACAGCAGT TTCGCAAGTG
CCTGGGCTTG AAGCTAAATG GCCGATTACA GTCAACGAAA TGGCTGTCCA AATTCCTACA
TCAGAAGCTG ACAAAAATGC TATCTACTTG GAATCCCACG TAAATCAGGG ATTTAAGACT
AACTCAGTTC AATCCAGATT ACAAACAGCC AACCAATTAC CTGAAATCAG AGTACAAAAA
CTATCCAGCA GTCAAGGTAT TGCAGGTTCT CAACCCACAA CCGTAATTGT GGAATCAGCA
AACACAGTCA GCAATGATGT TGATGCACAA CTGAAAGCGC AGCAAGAATT TGCCCTGAAT
CGCTTACAGG AAAAATCGAA CCGATTAAAA AATCGTTTAG CCGAGTTGCG GTCTAACGAT
ACTCAAAAGC TACCACAAAC TCGTAAGATT GAAATAGCAC AACCAACAAC TGTAGCTGAC
AAAACTGTAG CAGCACAATC GGAAGAAGCG ACAGATAGCA GCCAAGCTAG TTTGATATCC
AGGTTAAAAG AGGAAAAACA AACAAGTGCA GGAATGCAGA AGGTAACATC ATCTGCACCA
GCCGCACCTA AAGTTGTTGC CCAATCATTT GGGACAACCT ATGAAGTCAG ACCTGGAGAT
ACATTAGCTG AGATTGCCAG TAATTACGGT ATTTCAGTCT CAGAACTAGT TAAAGCTAAT
AATCTAGCAG ACCCCAATCA ATTACAAATC AGTCAACAAC TGATTATTCC CACAGCTATA
GCAGAACGGA GCAGTTACGA CCAGTCAACT CTAGCAGTTC AGCGTACTTT CGTCCAGTCC
AACAATATTG CCACAGTTGC TAGTTCTCCT CTGAACAAAC TTGATTTAGA TGCCAGTCTA
CCTCAAGTCT CACAGCCATC AGGAGTCAGA AATAGCAACT CTGCTGTTAT TTCTATTCCC
AAAGTTAAGG AAAACCAATT TCAGGTCAAT ACTCCGGCTG CTACCACCAA TAGCGTTAGC
ATTACTGTCC CTGTAGCTCA CGATAATGAG CAACAGTCAG GAGAAAATGA CCTATCTTCC
CCTATGCCTT ATAGTGTTGG TGGTGAAACT CCATCACGCG ATCGCGTCAC CAAAATCCCA
ACGGCTCAAA AACAACCTCA AAAGGTAGCC AAGGTCAAAG GTAACGAACG TCTCCGCAGC
TTACAAGTAG AAATTGAGAG ACTACGCGAA AAATATCGTC CTCAAGAGTC TGGTGTAACT
GTACAAACAC CGAGTGCAAG CGATGATTCC ACAGCGCCAG TTGCAATCGA AATAGGCAAT
CTATTTGCCC CCTCTAGAGT TAATTCTCAA CGGAATGCGG TAGCAATTCC TGTACCAAGA
CCAATACTAC CCAGTTACAG AGAACAACCT GTTAAGCCCC AATTGCGTGC TGCTCGTCCC
ATGAACGAGC CAATTAACCC AGAATTCCTG CCCAATCAAA CAACTTCATC TGGTAATTTT
GATCGCAATA ACTCATCTGG TATCAGATTG GTTGTTCCTT CTCCTAGTCT TAACTCGACC
GACTCCTTGG GTAAATTGCG AGGAACCAGA GTATCCCCAG CATTACCACC ATTAGCGGCA
GTTGATTTAT ACCTGCCTAA AAATGTTGAC CAAAATAGTA ATCCTCAATC TACCTCTTCA
GCATCTTACA TTTGGCCTGC TAAGGGTGTT CTCACATCTG GCTATGGTTG GCGCTGGGGC
AGAATGCACA GAGGTATTGA CATTGCTAAC GGTGTAGGCA CACCAATTTA TGCATCTGCT
CCCGGTGTGG TAGAAAGAGC AGGTTGGAAT AACGGTGGTT ATGGCAATGT GGTTGATATC
CGTCATCCCG ATGGTAGTAT GACTCGTTAT GGTCACAATA GCCGAATTTT AGTGCAAGTA
GGTCAACAAG TGGAACAAGG GCAAACTATT GCTGCTATGG GTAGCACTGG TTTTAGCACT
GGACCTCACA GCCACTTTGA AGTCCACCCA GCAGGTAAGG GTGCAGTGAA CCCAATTGCG
TTCCTACCAT CACAGGCACG TCTATAA
 
Protein sequence
MKRALKKRVK AVLNNNPNSD AAPVEQLNET NHKANCRART QAAMIGLAIS MGATSLLVTR 
QSDQAQAAAP VGSQKAASTI PAVSDTEIKF AATKLESQAV SSARLPENPV ILEPTAVSQV
PGLEAKWPIT VNEMAVQIPT SEADKNAIYL ESHVNQGFKT NSVQSRLQTA NQLPEIRVQK
LSSSQGIAGS QPTTVIVESA NTVSNDVDAQ LKAQQEFALN RLQEKSNRLK NRLAELRSND
TQKLPQTRKI EIAQPTTVAD KTVAAQSEEA TDSSQASLIS RLKEEKQTSA GMQKVTSSAP
AAPKVVAQSF GTTYEVRPGD TLAEIASNYG ISVSELVKAN NLADPNQLQI SQQLIIPTAI
AERSSYDQST LAVQRTFVQS NNIATVASSP LNKLDLDASL PQVSQPSGVR NSNSAVISIP
KVKENQFQVN TPAATTNSVS ITVPVAHDNE QQSGENDLSS PMPYSVGGET PSRDRVTKIP
TAQKQPQKVA KVKGNERLRS LQVEIERLRE KYRPQESGVT VQTPSASDDS TAPVAIEIGN
LFAPSRVNSQ RNAVAIPVPR PILPSYREQP VKPQLRAARP MNEPINPEFL PNQTTSSGNF
DRNNSSGIRL VVPSPSLNST DSLGKLRGTR VSPALPPLAA VDLYLPKNVD QNSNPQSTSS
ASYIWPAKGV LTSGYGWRWG RMHRGIDIAN GVGTPIYASA PGVVERAGWN NGGYGNVVDI
RHPDGSMTRY GHNSRILVQV GQQVEQGQTI AAMGSTGFST GPHSHFEVHP AGKGAVNPIA
FLPSQARL