Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4811 |
Symbol | |
ID | 9342618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4920688 |
End bp | 4921884 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | cysteine desulfurase |
Protein accession | YP_003723102 |
Protein GI | 298492925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAATAT ATCTAGATTA CAGTGCTACT ACTCCTACTC GACCCGAAGC GATCGCTACA ATGCAAGCAG TCTTAAATCA ACAGTGGGGT AATCCTTCCA GTTTACATGA GTGGGGCAAC CGTGCAGCAT TAGTTGTGGA ACAAGCAAGA ATACAAGTTG CAGGTTTAAT TAATGCTGTT CCCGAATCAA TTATCTTTAC TTCTGGGGGA ACAGAAGCAG ATAATTTAGC AGTTATGGGT GTGGCTCGAT GTTATCCTGT ACCACAACAT ATCATTATTT CTAGTGTGGA ACATTCGGCT GTTTCTGAAC CGGTGCGAAT GTTAGAAAAT TGGGGTTGGG AAGTTACCCG TTTAGGTGTA GATGGTAAAG GTAGAATTAA TCCCGAAGAT TTAAAAGCAG CTTTGCAACA TAACACTGTT TTGGTATCAG TGATTTATGG ACAAAGTGAA GTGGGAACTG TTCAACCGAT AGCAGAACTA GGAAGAATTA CCAAAATCCA TGGTGCTTTA TTCCATACAG ATGCGGTGCA AGTTGCGGGA CGTTTAGCGA TAGATGTCAA TAATTTAGGT ATTGATTTAT TGAGTTTATC TAGTCATAAA ATATATGGTC CCTTGGGTGC AGGTGCTTTA TATGTGCGTC CAGGCATGAA CTTAATACCA TTGTTAGGTG GTGGTGGACA AGAACAAGGA CTGCGTTCAG GTACACAAGC AACACCTGCT ATTGCTGGGT TTGGAGTAGC TGCGGAGTTA GCGGGACAGG AGTTAGAAAC AGAAAGACTA AGATTAACAG AATTGCGTGA TCGCCTCTTT ACCAAATTAG CAGATATTCC CAGTTTAATT CCCACAGGTG ACAGAATTCA CCGCTTACCC CATCATCTTA GCTTTTCTTT AGAATATGCC GATGGCGAAA AAATTAGTGG TAAAACCCTA GTCCGTCAAT TAAACTTAGC AGGAATCGGC ATTAGTGCAG GTGCTGCTTG TAATAGTGGA AAATTAAGTC CCAGTCCGAT TTTATTAGCA ATGGGGTATT CACAAATAGC CGCTTTGGGC GGAATTAGGT TAACTTTAGG AAAACAAACA ACAGCAGCAG ATGTTGATTG GACAGCAATA GTTTTGAAAC AAGTTCTACA GCGATTGACA GCAGATTTAT CCTTAGTGAT ACAATCCACC TCAATCACTT GCCAATTAGC AATTTGA
|
Protein sequence | MQIYLDYSAT TPTRPEAIAT MQAVLNQQWG NPSSLHEWGN RAALVVEQAR IQVAGLINAV PESIIFTSGG TEADNLAVMG VARCYPVPQH IIISSVEHSA VSEPVRMLEN WGWEVTRLGV DGKGRINPED LKAALQHNTV LVSVIYGQSE VGTVQPIAEL GRITKIHGAL FHTDAVQVAG RLAIDVNNLG IDLLSLSSHK IYGPLGAGAL YVRPGMNLIP LLGGGGQEQG LRSGTQATPA IAGFGVAAEL AGQELETERL RLTELRDRLF TKLADIPSLI PTGDRIHRLP HHLSFSLEYA DGEKISGKTL VRQLNLAGIG ISAGAACNSG KLSPSPILLA MGYSQIAALG GIRLTLGKQT TAADVDWTAI VLKQVLQRLT ADLSLVIQST SITCQLAI
|
| |