Gene Aazo_3645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3645 
Symbol 
ID9341450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3711327 
End bp3712682 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content38% 
IMG OID 
Productcysteine desulfurase family protein 
Protein accessionYP_003722336 
Protein GI298492159 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCTC TGGATATAAA ATGGATTCGC TCTCAATTTC CAGCTTTGAC GCAATCAATT 
AATGGTCATC CAGCTATTTT TTTTGATGGA CCTGGTGGAA CTCAAGTACC AGGTGCGGTA
TTGGATGGAA TGAGTAATTA TTTAGTCAGG TCTAATGCTA ATGCTCATGG GGATTTTGCT
ACCAGTGCGC GAACTGATGC GGTGATTAAT TCTGCAAGGG CTGCGAGCGC AGATTTTTTA
GGATGCGATA ATGATGAAGT GGTATTCGGT GCGAATATGA CCACTCTAAC CTTTAGTGTC
AGTCGTGCTA TTGGTCGAGA ACTGCAACCA GGTGATCAAA TAATTGTTAC CAAACTTGAT
CATGCAGCTA ATATTTCCCC TTGGTCTGCT TTAGAAGAAA AGGGTGTGAA TATTCAGGTT
GTGGACATCA ATGTTGCAGA CTGCACCCTC GATTTAAATG ATTTAGCAGC AAAGATTAAT
TCCCGCACAA AATTAGTAGC AGTGACTTAT GCTTCTAATG CTGTAGGAAC AATTAATGAT
ATTGCTAAAA TAGTTAAATT AGCTCATGCT GTTGGTGCTT TGGTTTTTGT TGATGCTGTT
CATTATTCTC CCCATGCACC TATGAATGTG CATCATTTAG ATTGTGATTT TCTAGTTTGT
TCCGCTTATA AATTCTTCGC TCCCCACGTT GGGATTTTAT ATGGAAAAAG AGAATATTTA
ACTCGTCTAA CTCCTTATAA AGTCAAACCT GGATCTAATG AAGTTCCATT TAAATGGGAA
ACCGGAACTT TGAACCATGA AGGTTTAGCG GGGTTAGTAG CCACAATTAA TTATTTAGCA
AAATTAGGTT GTCATGTTTC CCCAACTTTA GATAATGAAT TACTTGATTC TTTAATACAA
GCAGATAGAG AGGGTTTAAC TACTTTTCAT TGTCCCAGTT TTGTGACTGC ACCTGAACAA
CCTAGTCATG AGTTAGCTTC TGCTTATCAT AGTCGTCGTG CGGCTTTGTT AGCTGCAATG
TCAGCTATTC AAGAATATGA AAGAGAATTA AGTAAAAAGC TGATTTCTGG GTTGTTAGAA
ATTCCTGGTG TCACAGTTTA TGGTATTACT GAACCTAGCC AATTTATATG GAGAACTCCC
ACAGTTTCTA TCACAATTGA AGGCAAAAAC TCGGCAGATG TAGCCAAGTT TTTAGGAACC
AAAGGAATCT TTACTTGGCA TGGTCATTTC TATGCTATTG AACTCACAGA AAAGTTAGGG
GTAGAAACAT CTGGGGGTTT ATTGAGAATT GGATTAGCAC ACTATAATAA TGTAGAAGAA
ATTAATCAAT ATTTGTCGGT GTTAGTTGAG GTTTAA
 
Protein sequence
MESLDIKWIR SQFPALTQSI NGHPAIFFDG PGGTQVPGAV LDGMSNYLVR SNANAHGDFA 
TSARTDAVIN SARAASADFL GCDNDEVVFG ANMTTLTFSV SRAIGRELQP GDQIIVTKLD
HAANISPWSA LEEKGVNIQV VDINVADCTL DLNDLAAKIN SRTKLVAVTY ASNAVGTIND
IAKIVKLAHA VGALVFVDAV HYSPHAPMNV HHLDCDFLVC SAYKFFAPHV GILYGKREYL
TRLTPYKVKP GSNEVPFKWE TGTLNHEGLA GLVATINYLA KLGCHVSPTL DNELLDSLIQ
ADREGLTTFH CPSFVTAPEQ PSHELASAYH SRRAALLAAM SAIQEYEREL SKKLISGLLE
IPGVTVYGIT EPSQFIWRTP TVSITIEGKN SADVAKFLGT KGIFTWHGHF YAIELTEKLG
VETSGGLLRI GLAHYNNVEE INQYLSVLVE V