Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1849 |
Symbol | |
ID | 9339642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1921886 |
End bp | 1922983 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003721074 |
Protein GI | 298490897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.326919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGTT CAGCCTCTCC TCAAAGTTTG CTAATTTATG GTCTGAGTGT CCCGATTATC GCTCTCAATG TCTGGCTACT ATCGGTACTG TTTCGTTATT TCCAGCACCC CATCACTATC CTAAGTATTG CGGCTATTTT GGCATTTTTA CTCAACTACC CAGTTAAATT TTTAGAAAAA GCTAGGATTA CTCGTACTCA GGCAGTGATA ATAGTTTTAA TCATCACTTT GGCTTTGTTA GGAATTCTAT GTGTTACCCT TGTACCAATG GTAATTGAGC AAACAATCCA ACTTTTAAAT AAGATTCCTG ATTGGTTAGC TTCCAGTCAA GACAATCTGG GTAAATTGCA GGTAGTAGCG CGTCAAAGAA GAATAAATAT TGACTTTAGT CTAGTAACTA ATCAAATCAA TGCCAATATT CAAAATTTGG TCCAACAGAT AGCTTCCAGT GCGGTGGGAT TTGCCGGAAC CCTGTTATCA GGATTACTGA ACTTAGTGTT AGTAGTCGTA TTAGCCTTTT ATATGCTGTT ATATGGCGAT CACGTATGGT ATGGTCTAAT CAATCTCCTC CCGTCTAATA TTGGCATCCC CTTCAACAAG TCATTACAGT TAAATTTCCA GAACTTTTTC CTCAGTCAAC TATTGCTGGG ACTTTTCATG GAACTAGTCC TTACTCCCAT TTTCCTATTT TTGAGAGTAC CATTTGCCCT GTTATTTGCC ATTGTTATTG GTCTTTCGGA ACTGATTCCC TTTGTCGGAG CAACTTTAGG CATTAGTTTA GTCACAATTT TGGTATTACT ACAAAATTGG TGGTTAGCAT TTCCAGTAGC AACGGTGGCA ATTGTCCTAC AACAAATCAA GGATAATCTC TTAGCACCTA AGTTACTAGG TAACTTTATT GGACTCAACC CAATATGGAT TTTTGTCTCT ATTTTGATGG GATTTGAAAT TGCGGGTTTA TTGGGAACAC TAGTTGCTGT ACCAATTGCT GGCACTATCA AAGGTACATT TGACGCTATT AAAAGTAGTA AACATAATGA ATATGTATCA AACTTTACCG TCACTTATGA ATCAAAATCT GGTGAAAATG ATAAATAG
|
Protein sequence | MRRSASPQSL LIYGLSVPII ALNVWLLSVL FRYFQHPITI LSIAAILAFL LNYPVKFLEK ARITRTQAVI IVLIITLALL GILCVTLVPM VIEQTIQLLN KIPDWLASSQ DNLGKLQVVA RQRRINIDFS LVTNQINANI QNLVQQIASS AVGFAGTLLS GLLNLVLVVV LAFYMLLYGD HVWYGLINLL PSNIGIPFNK SLQLNFQNFF LSQLLLGLFM ELVLTPIFLF LRVPFALLFA IVIGLSELIP FVGATLGISL VTILVLLQNW WLAFPVATVA IVLQQIKDNL LAPKLLGNFI GLNPIWIFVS ILMGFEIAGL LGTLVAVPIA GTIKGTFDAI KSSKHNEYVS NFTVTYESKS GENDK
|
| |