Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_2145 |
Symbol | |
ID | 9339945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 2228318 |
End bp | 2229676 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003721287 |
Protein GI | 298491110 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.657234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGCCAC TATTACTCGG TTTGACTCTC GCTCAAGCAA ATCAATCCTC ACCACCACCT GAAGAAGTGG TACAACCACA AGAAGTTCGA GCGTTATCAG GACAGTTAGA TACTGTGCCA GTTTTTAATA GCAATAGCCC AGAATTGGTA TTAAAAGAAG GTATATTACT TTCTACCTTT CCCCCGAACG GTAAAAAGGT ACCAACTGCA CATTTAAATT TTGCTTTTCG GGGACGCTTT GATATTTTTG CTCATCATGT TGCTAAAGCA GAACCACCAG AAAGTTTGCG TTCTTTGTAT TTAGGCATCA TTTTGCATAA CCCTGGAACT CAGCCAGTAA AGGTGAATAT ATTACAGGGG GCAAGTTATT TAAGTCAACC GGATGCACCA TTTATTAAGT TGGATGCTTT TATCCCTAAT AATGCAGGTA CGGTGTTTGC CGGACCGGGT AGTCGTGTGA TGTCTGATGT GCTGGGGGGA AGACGACAAG GAATTTTCCC TGCTCAAATT GTCATTCCTC CTGGGGAAAG TCATATGTTA TTAAATCTAC CTATTCCGGT GCAAGGATTG ACACCACCAT TAAATGGCCG TTCTTCATTT ATGCGTCTGC GGAGTAATGG TACTGTGTAT GCTGCTAGTC TGGCTATGTT TGCACCAACT AATAAAGATG GTAGTGAACG TGCGCCAACT TTAGGAGAAT GGCAAAATCT ACTGAATAAT GGTGAATTAT CCACACCTAG AGATAAAGTT CCCACTCCTT TAGAGGAGAC TGGTAAACTG AGAATTTATG GAAGAGTGGC AGGTGTAGCG AGTGGTTCGG TGTGGCGATC ACTCTTGGTA GATAGTCCTA AAACTAATTA TTTAACTATT CCCCAACCTG GTCAAGCTTT TTCTTATGTT TTAAGCACAG TGGATGGTGG TACTTTGGGA ACTGGTCAAA TTCAAAGTGC ATCTATGTTA GTGCGTTATC CTGATACAGC TTATCGCGCC CATGGCAATT ATGGAGTTCA ATATAGTTTG AAGTTGCCTT TGTACAACAA TACACAAAGT CACCAGAAGG TGAGTGTGTT GGTGCAAACC CCAATTAAAG AAGATCAATT AAGTCAGTCA GGGTTGCGCT TTTTCACTAG ACTAGCACGT CAAGTTTTCT TCCGGGGAAC TGTGCGAATT CGGTATAAAG ATAATCAAGG TCAACCAAAA ACGGAATTTG TGCATTTAGT CCAAACCAGA GGTGAACCAG GTCAACCTTT GGCGTTATTA AATATGAAAC CAGGTGATCG CTCTTTAGTA GAAGTAGACT TTCTCTATCC TCCTGATGCT TCACCACCAC AAGTTTTAAC TGTGTCAGTT CAAGAGTAA
|
Protein sequence | MLPLLLGLTL AQANQSSPPP EEVVQPQEVR ALSGQLDTVP VFNSNSPELV LKEGILLSTF PPNGKKVPTA HLNFAFRGRF DIFAHHVAKA EPPESLRSLY LGIILHNPGT QPVKVNILQG ASYLSQPDAP FIKLDAFIPN NAGTVFAGPG SRVMSDVLGG RRQGIFPAQI VIPPGESHML LNLPIPVQGL TPPLNGRSSF MRLRSNGTVY AASLAMFAPT NKDGSERAPT LGEWQNLLNN GELSTPRDKV PTPLEETGKL RIYGRVAGVA SGSVWRSLLV DSPKTNYLTI PQPGQAFSYV LSTVDGGTLG TGQIQSASML VRYPDTAYRA HGNYGVQYSL KLPLYNNTQS HQKVSVLVQT PIKEDQLSQS GLRFFTRLAR QVFFRGTVRI RYKDNQGQPK TEFVHLVQTR GEPGQPLALL NMKPGDRSLV EVDFLYPPDA SPPQVLTVSV QE
|
| |