Gene Aazo_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4118 
Symbol 
ID9341923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4185725 
End bp4187356 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content38% 
IMG OID 
Productfamily 1 extracellular solute-binding protein 
Protein accessionYP_003722682 
Protein GI298492505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.794373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGGT GGGGTAGAAT AGCGAAATTT CTATCTTTAT TCTCTATCTG TTTATTGTTG 
ACTGTAAGCT GTACCCCTCC TCAACAGATA ACTACTCCAA CATCTGGTGC TGTCAATACT
CCTGCCAGTG ATGGACGGAT TACTATCGGT ACGACTCAGA AACCTCGTAC CCTTGATCCG
GCTGATGCGT ATGAATTAGC ATCTATGGGT TTGGTGTTTA ATATGAGTGA TCGCCTATAT
ACTTACGAAC CAGGGAGTAT AGAAATTAAA CCCCAACTGG CTACAGCTTT ACCTAAAGTT
AGTGCAGATG GTTCAACATA CACCATACCT ATACGTCAAG GAGTGCTCTT TCACGATGGT
ACACCTTTTA ACGCTAAAGC AATGGAATTT ACCATCCAGC GTTTTATCGA AAATAAAGGT
AAACCATCTT TCTTACTATC AGATACGGTA GATTCAGTGA AAGCTACAGG GGATTATGAA
TTAACAATTA AGCTGAAAAA GCCCTTTGCA GCTTTTCCTT CACTGTTAGC ATTTTCTGGA
GTGTGTGCTG TCTCTCCGAG AGCTTACGAA ATAGGTGCAG GTAAATTTCA ACCCAATATC
TTTGTGGGAA CTGGCCCTTA TAAATTAGCC CAGTATGGGA CTGATTCTCT CAGATTTGAT
GTATTTGATA AATATTGGGG AGAAAAACCA GCGAATAAAG GTATTAATGT CCAGATTCAA
AGCAGTCCAG TGAATTTGTT CAATGCTTTT AAAACTAGTG CGGTAGACGT TGCTTATCTA
TCTTTACAAC CAGACCAAAT TCGTAGTTTA GAAGAAGGTG CTAAAAAAGG AGATTGGCAA
ACCATCACTG CCCAAGGTAG TGTAGTGAGT TATATGGTGT TGAATCGCAA TCAGAAACCT
TTGGATAAAC CAGAAGTTAG AAGAGCGATC GCATCACTCA TTAATCGTCA ATTATTCAAT
GAGCGAGTTT TGTTTAATCA GGCAGATCTA CTTTACACCA TGATTCCCAC TACCTTTAAT
GTTTCCCAGC CATTATTTCA AGCTAAATAT GGTGATGGTA ACTTTGAAGA AGCTAAAAAG
TTGTTAACTA CCGTTGGTTT TTCCCAACAA AATCCCGCTA AAGTGCAAGT TTGGTATCCT
GCGAGTTCAC CAACTCGAAG TTTAGCCGCA CAAACACTCA AATCCTTGGC TGATACTAAA
ATGGATGGGA TATTACAATT GGAAGTAAAA ACCGTAGAAG GTGCTACATT TTTTAAAGAA
ATTTCCAAAA GTTTATATCC AGTAGCTTTA CTAGATTGGT ATCCAGACTT TTTAGACCCA
GATAATTACG TACAACCATT TTTAGCTTGT GAAAAAGGTT CAGAATCAAA AGGCTGTGAA
GACGGAGGAA GTCAAATGCA AGGGTCATTT TACTATAGCG AAACAATGAA TAAACTCATT
GATCAACAAC GTAAGGAACA AAACCCAGAA GCCAGAAAAA GAATATTTAC TGAGATTCAA
AGCCAAGTAG TTAATGATGT CCCTTATATT CCTTTATGGC AAAACAAAGA TTATGTATTT
GCCCAAAAAG GTGTAAGTAA TGTAAAACTT GATCCTACCC AGAACTTGAT TTACAAGAAC
ATTAAAAAGT AG
 
Protein sequence
MTRWGRIAKF LSLFSICLLL TVSCTPPQQI TTPTSGAVNT PASDGRITIG TTQKPRTLDP 
ADAYELASMG LVFNMSDRLY TYEPGSIEIK PQLATALPKV SADGSTYTIP IRQGVLFHDG
TPFNAKAMEF TIQRFIENKG KPSFLLSDTV DSVKATGDYE LTIKLKKPFA AFPSLLAFSG
VCAVSPRAYE IGAGKFQPNI FVGTGPYKLA QYGTDSLRFD VFDKYWGEKP ANKGINVQIQ
SSPVNLFNAF KTSAVDVAYL SLQPDQIRSL EEGAKKGDWQ TITAQGSVVS YMVLNRNQKP
LDKPEVRRAI ASLINRQLFN ERVLFNQADL LYTMIPTTFN VSQPLFQAKY GDGNFEEAKK
LLTTVGFSQQ NPAKVQVWYP ASSPTRSLAA QTLKSLADTK MDGILQLEVK TVEGATFFKE
ISKSLYPVAL LDWYPDFLDP DNYVQPFLAC EKGSESKGCE DGGSQMQGSF YYSETMNKLI
DQQRKEQNPE ARKRIFTEIQ SQVVNDVPYI PLWQNKDYVF AQKGVSNVKL DPTQNLIYKN
IKK