Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3645 |
Symbol | |
ID | 9341450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3711327 |
End bp | 3712682 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | cysteine desulfurase family protein |
Protein accession | YP_003722336 |
Protein GI | 298492159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCTC TGGATATAAA ATGGATTCGC TCTCAATTTC CAGCTTTGAC GCAATCAATT AATGGTCATC CAGCTATTTT TTTTGATGGA CCTGGTGGAA CTCAAGTACC AGGTGCGGTA TTGGATGGAA TGAGTAATTA TTTAGTCAGG TCTAATGCTA ATGCTCATGG GGATTTTGCT ACCAGTGCGC GAACTGATGC GGTGATTAAT TCTGCAAGGG CTGCGAGCGC AGATTTTTTA GGATGCGATA ATGATGAAGT GGTATTCGGT GCGAATATGA CCACTCTAAC CTTTAGTGTC AGTCGTGCTA TTGGTCGAGA ACTGCAACCA GGTGATCAAA TAATTGTTAC CAAACTTGAT CATGCAGCTA ATATTTCCCC TTGGTCTGCT TTAGAAGAAA AGGGTGTGAA TATTCAGGTT GTGGACATCA ATGTTGCAGA CTGCACCCTC GATTTAAATG ATTTAGCAGC AAAGATTAAT TCCCGCACAA AATTAGTAGC AGTGACTTAT GCTTCTAATG CTGTAGGAAC AATTAATGAT ATTGCTAAAA TAGTTAAATT AGCTCATGCT GTTGGTGCTT TGGTTTTTGT TGATGCTGTT CATTATTCTC CCCATGCACC TATGAATGTG CATCATTTAG ATTGTGATTT TCTAGTTTGT TCCGCTTATA AATTCTTCGC TCCCCACGTT GGGATTTTAT ATGGAAAAAG AGAATATTTA ACTCGTCTAA CTCCTTATAA AGTCAAACCT GGATCTAATG AAGTTCCATT TAAATGGGAA ACCGGAACTT TGAACCATGA AGGTTTAGCG GGGTTAGTAG CCACAATTAA TTATTTAGCA AAATTAGGTT GTCATGTTTC CCCAACTTTA GATAATGAAT TACTTGATTC TTTAATACAA GCAGATAGAG AGGGTTTAAC TACTTTTCAT TGTCCCAGTT TTGTGACTGC ACCTGAACAA CCTAGTCATG AGTTAGCTTC TGCTTATCAT AGTCGTCGTG CGGCTTTGTT AGCTGCAATG TCAGCTATTC AAGAATATGA AAGAGAATTA AGTAAAAAGC TGATTTCTGG GTTGTTAGAA ATTCCTGGTG TCACAGTTTA TGGTATTACT GAACCTAGCC AATTTATATG GAGAACTCCC ACAGTTTCTA TCACAATTGA AGGCAAAAAC TCGGCAGATG TAGCCAAGTT TTTAGGAACC AAAGGAATCT TTACTTGGCA TGGTCATTTC TATGCTATTG AACTCACAGA AAAGTTAGGG GTAGAAACAT CTGGGGGTTT ATTGAGAATT GGATTAGCAC ACTATAATAA TGTAGAAGAA ATTAATCAAT ATTTGTCGGT GTTAGTTGAG GTTTAA
|
Protein sequence | MESLDIKWIR SQFPALTQSI NGHPAIFFDG PGGTQVPGAV LDGMSNYLVR SNANAHGDFA TSARTDAVIN SARAASADFL GCDNDEVVFG ANMTTLTFSV SRAIGRELQP GDQIIVTKLD HAANISPWSA LEEKGVNIQV VDINVADCTL DLNDLAAKIN SRTKLVAVTY ASNAVGTIND IAKIVKLAHA VGALVFVDAV HYSPHAPMNV HHLDCDFLVC SAYKFFAPHV GILYGKREYL TRLTPYKVKP GSNEVPFKWE TGTLNHEGLA GLVATINYLA KLGCHVSPTL DNELLDSLIQ ADREGLTTFH CPSFVTAPEQ PSHELASAYH SRRAALLAAM SAIQEYEREL SKKLISGLLE IPGVTVYGIT EPSQFIWRTP TVSITIEGKN SADVAKFLGT KGIFTWHGHF YAIELTEKLG VETSGGLLRI GLAHYNNVEE INQYLSVLVE V
|
| |