Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4403 |
Symbol | |
ID | 9342205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4482743 |
End bp | 4484053 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003722841 |
Protein GI | 298492664 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0196704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCCA TCAGTGTTGA AATTCTCATT ATCTTGGTGC TAATCTTTGC CAACGGTGTA TTTTCGATGT CCGAGATGGC GATAGTCTCC GCACGGAAGG TGCGATTACA GCAATTAGCT AATCAAGGCA ACCTCAATGC CAGGGCTGCA TTAGAACTAG CCGAATCTCC CAATCATTTT CTGTCCATTG TCCAGGTTGG AATTACACTG ATCAATATTC TCAATGGTGT ATTTGGTGGT GCTACCATTG CCCAAAGGCT AGAAGGCTAT GTGAAGCTAG TTCCATTCTT AGCTGGTTAT AGCCAACCCA TAGCTTTTAG TTTAGTTGTA TTACTAATCA CCTATTTTTC CCTAATTGTC GGTGAACTCG TACCTAAGCG GTTAGCATTA AACAACCCTG AACGAATTGC AGCAACTGTT GCTATTCCCA TGCGGGCTTT GGCTGCTTTA GCTTCCCCAG TGGTGTTTTT ATTAAGTGCT TCTACAGAAA CGGTGTTGCG AATTTTGGGA ATTACACCTT CAGATGAACC GCAAGTTACG GAAGAAGAAA TAAAAATTTT AATAGAACAA GGGACGGAAG CGGGAACTTT TGAGGAAGCA GAACAGGATA TGGTGGAAAG GGTATTTCGG TTAGGCGATC GCCCTGTAAC TGGTTTCATG ACACCCAGAC CGGATATAGT TTGCTTAGAC TTAGAAGATC CCGCAGAAGA AAACCGCCAA AAAATGGCCG ACAGCGCCTA TTCTCGATAT CCAGTTTGTC AAGCAGGACT AGATAACGTC CTGGGAATCA TCCCTGTTAC GGACTTATTA GCCAGAAGTT TACGCAATGA ACCCTTAGAC TTAACCCTAG AATTACGTCA GCCTGTATTT GTACCGGAAA GCACTCGTGG TTTAAAAGTT TTGGAGTTAT TCAAGCAAAC AGTAACTCAC ATGGCCTTAG TAGTCGATGA ATACGGCGTA ATTCAAGGCT TAGTAACTTT AAATGACATC ATGAGTGAAA TCGTCGGTGA TGTTCCCGCC GGACCCGGAC AGGAAGAACC ACAAGCTGTA CAACGGGAAG ATGGTTCTTG GTTAGTGGAT GGAATGTTAC CTGTAGAAGA GTTCTTGGAA CTTTTTGGTC TCGAAGAACT GGAAAATGAA GAAAGAGGAA ATTATCAAAC ATTAGGCGGT TTTATTATCA CCCATTTAGG GCGTATTCCC GCAGCCGCAG ATCATTTTGA ATGGGATGGT ATACGTTTTG AAGTTATGGA CATGGATGGC AACCGGGTAG ATAAGGTATT AATTATGCCA AGAGTACCTA ATAATAGGTA A
|
Protein sequence | MSSISVEILI ILVLIFANGV FSMSEMAIVS ARKVRLQQLA NQGNLNARAA LELAESPNHF LSIVQVGITL INILNGVFGG ATIAQRLEGY VKLVPFLAGY SQPIAFSLVV LLITYFSLIV GELVPKRLAL NNPERIAATV AIPMRALAAL ASPVVFLLSA STETVLRILG ITPSDEPQVT EEEIKILIEQ GTEAGTFEEA EQDMVERVFR LGDRPVTGFM TPRPDIVCLD LEDPAEENRQ KMADSAYSRY PVCQAGLDNV LGIIPVTDLL ARSLRNEPLD LTLELRQPVF VPESTRGLKV LELFKQTVTH MALVVDEYGV IQGLVTLNDI MSEIVGDVPA GPGQEEPQAV QREDGSWLVD GMLPVEEFLE LFGLEELENE ERGNYQTLGG FIITHLGRIP AAADHFEWDG IRFEVMDMDG NRVDKVLIMP RVPNNR
|
| |