Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0998 |
Symbol | |
ID | 9338793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1055961 |
End bp | 1057373 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | mammalian cell entry related domain-containing protein |
Protein accession | YP_003720493 |
Protein GI | 298490316 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0397137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGTC TAATTAGCGG CTTCACGTCT ACACGAACTT TTAGAGAAGG CTCAGTGGGA TTATTACTTT TACTGGGTTT AGGAGCATTT GGAATAATTC TCCTATGGTT AAATAGAATC CCCCTTGGAC GCAGTTCTTA TAAAGCTGTG GTGGAATTTG CTAACGCTGG GGGAATGCAA AAAGGCTCAC CAGTTCGTTA TCGTGGCGTA AAAGTTGGTA GTATTTCCAA TATTAAAACC GCAGTCAATG CTGTTGCTGT AGAAATTGAA ATCAACGATC CTAACTTGAT AATTCCCGCA GATTCTAAAA TTCAAGCCAG TCAAACTGGA TTAATTAGCG AAAGTATTAT TGATATTACC CCAATAACCA ACCTAGCGAC GGGAACTAAT ATTGCTAAAC CCTTAGACAA AGATTGTAAT CCCAGTCTGA TTATTTGTAA TGAAATCAGT ACATTAAAAG GTCAAATTGG TATCAGCGTT GATGAACTGA TTCGTCAATC ATCTGATTTT ACGGCTCAAT ATAATAACAA GGAATTTTAT CAAAACGTTA ATCGCTTGTT AGTAACCTCT GCATCAGCAG CTTCTAGTGT TGCTAACCTC AGTCGAGAAC TCCAGAGTGT GAGCAAAAGC TTTAAAGGGC AAATCGGCAC ATTTTCTAAT ACTGCTGTCA CCATCCAGAA AGCTACAAAT GAACTCACTA CAACTACATC TAAAACCGCA AATCAATTAG GTGAAACAGC CAGCGAGTTT AGTAAAACTG CACAACAAGC AGGTAGTTTG TTGAATAACT TAGATGAATT ATTGACAACA AACCGTTCGT CCCTAGTTAG GACTTTAAAT AATATTACTC AAACTAGTAA CCAACTGCGT CAGACAGTTA GTGGTTTATC ACCTGCGGTT AATCGTTTAA ATGAAGGGGG ATTATTGAAT AATTTAGAAC TTTTGTCTGC GAACGCTGCG GAAGCTTCAA CTAATTTAAA AGACGCATCC AAGACCTTAA ATAATCCTAA AAATATTGTC TTACTTCAAC AAACCCTAGA TGCTGCCAGG GTGACATTTG AAAATACCCA AAAAATTACA TCTGATTTAG ATGAATTAAC AGGCGATCCT CAATTTCGCC AAAATCTCCT GCAGTTGGTG AATGGTTTGA GTAAATTAGT ATCTTCTACA CAGGATATGC AGGAACAAGC AAAGGTAGCT GTCACCTTAG ACTCTCTCAA AGCATCTATG AACCAGGCAG AACTCTTAAC TTCTATCCCC GTCAAAAAAG TTGAAGTAGA AAAACCAGAA TTTATCACCC CCACCCCAAT TGAAAAAGCT GATGTAATTC AGTTAGATTT AGAAATATCT ACACCACCGG AAACTCCTCT TGGGGAAGCA GGGGAGCAGG GAGCAGGGAG CAGGGGGAGA TAA
|
Protein sequence | MRSLISGFTS TRTFREGSVG LLLLLGLGAF GIILLWLNRI PLGRSSYKAV VEFANAGGMQ KGSPVRYRGV KVGSISNIKT AVNAVAVEIE INDPNLIIPA DSKIQASQTG LISESIIDIT PITNLATGTN IAKPLDKDCN PSLIICNEIS TLKGQIGISV DELIRQSSDF TAQYNNKEFY QNVNRLLVTS ASAASSVANL SRELQSVSKS FKGQIGTFSN TAVTIQKATN ELTTTTSKTA NQLGETASEF SKTAQQAGSL LNNLDELLTT NRSSLVRTLN NITQTSNQLR QTVSGLSPAV NRLNEGGLLN NLELLSANAA EASTNLKDAS KTLNNPKNIV LLQQTLDAAR VTFENTQKIT SDLDELTGDP QFRQNLLQLV NGLSKLVSST QDMQEQAKVA VTLDSLKASM NQAELLTSIP VKKVEVEKPE FITPTPIEKA DVIQLDLEIS TPPETPLGEA GEQGAGSRGR
|
| |