Gene Aazo_1311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1311 
Symbol 
ID9339106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1388506 
End bp1390017 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content45% 
IMG OID 
Productthreonine dehydratase, biosynthetic 
Protein accessionYP_003720707 
Protein GI298490530 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.484078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTGCG ACTACCTGGT ACAAATCCTC ACTGCCCGCG TCTATGATGT AGCCCAGGAA 
ACACCACTGG AATATGCCCC CAACCTCTCG AACCGCATCA ATAACAAACT GCTGCTGAAG
CGGGAAGATA TGCAGTCTGT CTTCTCCTTC AAGCTGCGGG GTGCATATAA CAAAATGGTC
AACTTACCAC CAGATTTACT GAAACAAGGT GTGATTGCTG CTTCTGCTGG AAATCATGCC
CAAGGTGTGG CCTTGGCTGC CGGTCGTTTG GGAACAAAAG CAATCATCGT TATGCCGATG
ATTACGCCCC AGGTAAAAAT CAATGCAGTC AAAACACGGG GAGGAGAAGT TGTACTACAT
GGCTATAACT ATGATGATGC TTACGCCCAC GCCCGGCAAT TGGAAGCAGA AAAAGGGTTA
ACCTTTATAC ATCCCTTTGA TGATCCTGAC GTTATTGCTG GCCAGGGAAC AATCGGCATG
GAAATACTGC GACAATATCA GCAACCTATC CATGCAATAT TTGTGGCTAT TGGAGGTGGA
GGTTTAATTT CCGGGATTGC GGCTTATGTG AAACGGTTGC GTCCCGAAAT CAAAATTATC
GGTGTCGAAC CTGTCGATGC TGATGCTATG TCTCAATCTA TTCAAGCGGG TCATCGGGTG
CGGTTGTCTC AGGTGGGTTT ATTTGCTGAT GGGGTAGCGG TGCGGGAAGT GGGGGAAGAA
ACCTTCCGGT TGTGTCAGCA ATATGTGGAT GAAATTATTT TGGTGGATAC GGATGCTACC
TGTGCAGCGA TAAAAGATGT GTTTGAAGAT ACCCGTTCTA TTTTAGAACC TGCTGGAGCA
TTAGCGATCG CAGCTGCTAA AGCCTACGCG GAACGAGAAC AAATCCAAGA ACAAACCCTG
ATCGGCGTGG CTTGCGGTGC TAACATGAAC TTTGAGCGCT TGCGTTTCGT GGCAGAACGG
GCAGAATTTG GGGAACGTCG AGAAGCCATT TTTGCTGTGA ATATTCCCGA AAAACTAGGC
AGTTTGCGCC AGTTTTGTGA ATGTTTAGGC GAGCGTAATT TAACAGAATT TAACTATCGC
ATTGCTGACA ATAAAGAAGC CCATATATTT GTCGGAGTGC AAATTGAAAA CCGCGCCGAC
GCAGCTAAAA TAGTTGATAA ATTTGAAGCT AGTGGTTTAA AAACAATTGA TTTAACCGAT
GATGAATTAA CAAAATTGCA CCTGCGGCAC ATGGTCGGTG GACATTCTAC CCTTGCTCAA
AATGAATTAT TATATCGGTT TGAATTTGTT GAACGTCCTG GTGCATTAAT GAAATTTGTC
GGTTCTATGT CTCCCGACTG GAATATTAGC TTATTTCACT ATCGCAACAA CGGCGCAGAC
TATGGGCGGA TTGTAGTCGG AATGCAAGTA CCACCCCATG AAATGGAAGA ATGGCAAATA
TTTCTTGATT CCTTGGGTTA TCACTATTGG GATGAAAGCC AAAATACCGC CTATAAGTTG
TTTTTAGGTT AA
 
Protein sequence
MYCDYLVQIL TARVYDVAQE TPLEYAPNLS NRINNKLLLK REDMQSVFSF KLRGAYNKMV 
NLPPDLLKQG VIAASAGNHA QGVALAAGRL GTKAIIVMPM ITPQVKINAV KTRGGEVVLH
GYNYDDAYAH ARQLEAEKGL TFIHPFDDPD VIAGQGTIGM EILRQYQQPI HAIFVAIGGG
GLISGIAAYV KRLRPEIKII GVEPVDADAM SQSIQAGHRV RLSQVGLFAD GVAVREVGEE
TFRLCQQYVD EIILVDTDAT CAAIKDVFED TRSILEPAGA LAIAAAKAYA EREQIQEQTL
IGVACGANMN FERLRFVAER AEFGERREAI FAVNIPEKLG SLRQFCECLG ERNLTEFNYR
IADNKEAHIF VGVQIENRAD AAKIVDKFEA SGLKTIDLTD DELTKLHLRH MVGGHSTLAQ
NELLYRFEFV ERPGALMKFV GSMSPDWNIS LFHYRNNGAD YGRIVVGMQV PPHEMEEWQI
FLDSLGYHYW DESQNTAYKL FLG