Gene Aazo_4811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4811 
Symbol 
ID9342618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4920688 
End bp4921884 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content41% 
IMG OID 
Productcysteine desulfurase 
Protein accessionYP_003723102 
Protein GI298492925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAT ATCTAGATTA CAGTGCTACT ACTCCTACTC GACCCGAAGC GATCGCTACA 
ATGCAAGCAG TCTTAAATCA ACAGTGGGGT AATCCTTCCA GTTTACATGA GTGGGGCAAC
CGTGCAGCAT TAGTTGTGGA ACAAGCAAGA ATACAAGTTG CAGGTTTAAT TAATGCTGTT
CCCGAATCAA TTATCTTTAC TTCTGGGGGA ACAGAAGCAG ATAATTTAGC AGTTATGGGT
GTGGCTCGAT GTTATCCTGT ACCACAACAT ATCATTATTT CTAGTGTGGA ACATTCGGCT
GTTTCTGAAC CGGTGCGAAT GTTAGAAAAT TGGGGTTGGG AAGTTACCCG TTTAGGTGTA
GATGGTAAAG GTAGAATTAA TCCCGAAGAT TTAAAAGCAG CTTTGCAACA TAACACTGTT
TTGGTATCAG TGATTTATGG ACAAAGTGAA GTGGGAACTG TTCAACCGAT AGCAGAACTA
GGAAGAATTA CCAAAATCCA TGGTGCTTTA TTCCATACAG ATGCGGTGCA AGTTGCGGGA
CGTTTAGCGA TAGATGTCAA TAATTTAGGT ATTGATTTAT TGAGTTTATC TAGTCATAAA
ATATATGGTC CCTTGGGTGC AGGTGCTTTA TATGTGCGTC CAGGCATGAA CTTAATACCA
TTGTTAGGTG GTGGTGGACA AGAACAAGGA CTGCGTTCAG GTACACAAGC AACACCTGCT
ATTGCTGGGT TTGGAGTAGC TGCGGAGTTA GCGGGACAGG AGTTAGAAAC AGAAAGACTA
AGATTAACAG AATTGCGTGA TCGCCTCTTT ACCAAATTAG CAGATATTCC CAGTTTAATT
CCCACAGGTG ACAGAATTCA CCGCTTACCC CATCATCTTA GCTTTTCTTT AGAATATGCC
GATGGCGAAA AAATTAGTGG TAAAACCCTA GTCCGTCAAT TAAACTTAGC AGGAATCGGC
ATTAGTGCAG GTGCTGCTTG TAATAGTGGA AAATTAAGTC CCAGTCCGAT TTTATTAGCA
ATGGGGTATT CACAAATAGC CGCTTTGGGC GGAATTAGGT TAACTTTAGG AAAACAAACA
ACAGCAGCAG ATGTTGATTG GACAGCAATA GTTTTGAAAC AAGTTCTACA GCGATTGACA
GCAGATTTAT CCTTAGTGAT ACAATCCACC TCAATCACTT GCCAATTAGC AATTTGA
 
Protein sequence
MQIYLDYSAT TPTRPEAIAT MQAVLNQQWG NPSSLHEWGN RAALVVEQAR IQVAGLINAV 
PESIIFTSGG TEADNLAVMG VARCYPVPQH IIISSVEHSA VSEPVRMLEN WGWEVTRLGV
DGKGRINPED LKAALQHNTV LVSVIYGQSE VGTVQPIAEL GRITKIHGAL FHTDAVQVAG
RLAIDVNNLG IDLLSLSSHK IYGPLGAGAL YVRPGMNLIP LLGGGGQEQG LRSGTQATPA
IAGFGVAAEL AGQELETERL RLTELRDRLF TKLADIPSLI PTGDRIHRLP HHLSFSLEYA
DGEKISGKTL VRQLNLAGIG ISAGAACNSG KLSPSPILLA MGYSQIAALG GIRLTLGKQT
TAADVDWTAI VLKQVLQRLT ADLSLVIQST SITCQLAI