Gene Aazo_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0404 
Symbol 
ID9338188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp405667 
End bp406803 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_003720084 
Protein GI298489907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGT GCGAATACAA GCCTGGTTTA GAAGGCATTC CCGCAGCCCA ATCGAGTATT 
AGTTATGTAG ATGGGCAAAA GGGAATACTG GAGTATCGTG GCATTCGGAT TGAGGAATTA
GCAGAAAGAA GTACATTCTT AGAAACTGCT TATCTCTTAA TCTGGGGTGA ATTGCCAAAC
AAGGAAGAAT TGGCGGCTTT TGAAGATGAA GTTTGTACCC ACAGGCGGAT AAAATACCGC
ATTCGGGATA TGATGAAATG CTTCCCCGAA AGCGGTCATC CAATGGATGC TCTACAAGCC
TCTGCTGCTG CCTTGGGTTT ATTTTATTCT CGTCGTGACT TACATAACCC TGTCTATATT
CGGAATGCAG TAGTACGTTT AGTAGCGATA ATTCCCACAA TGGTAGCGGC TTTCCAGTTG
ATGCGGAAGG GTAATGATCC GGTAAAACCA AGGGATGATT TGGATTATGC CGCCAACTTC
TTGTATATGC TCAATGAGAA AGAACCCGAT CCGTTAGCAG CGCGAATTTT TGACATCTGT
TTGATTCTGC ACGTTGAGCA TACAATGAAT GCTTCTACCT TCAGTGCGAG AGTTACAGCT
TCTACCTTAA CTGACCCTTA TGCTGTGGTT GCCAGTGCGG TAGGTACTTT AGGCGGTCCG
CTGCATGGTG GTGCTAACGA AGAAGTTATT GAGATGTTGG AAGAAATTAG CTGTGTGGAT
AATGTCCGTT CCTACATAGA AGATCGTCTG CAAAAGAAGG CGAAAATTAT GGGGTTTGGA
CACCGTGTAT ATAAGGTAAA AGATCCACGG GCAACTATCT TACAACGATT GGCAGAGCAA
CTGTTCGATA AGTTTGGCTA CGATAAGTAT TATGAAGTTG CTCAAGAAGT AGAATGGGTA
ATGGCCGAGA AAGTCGGCAG CAAAGGGATT TATCCTAATG TTGACTTTTA CTCTGGGCTC
GTGTATAGGA AAATGGGAAT TCCCACGGAT TTATTTACAC CTGTATTTGC GATCGCTCGT
GTGGCTGGTT GGTTAGCACA CTGGAAAGAA CAACTTGCAG AAAACCGGAT TTTCCGTCCT
ACCCAAGTTT ATAACGGTCG TCACGAAATC ACTTACACTC CCATCGACAA GCGTTAA
 
Protein sequence
MTVCEYKPGL EGIPAAQSSI SYVDGQKGIL EYRGIRIEEL AERSTFLETA YLLIWGELPN 
KEELAAFEDE VCTHRRIKYR IRDMMKCFPE SGHPMDALQA SAAALGLFYS RRDLHNPVYI
RNAVVRLVAI IPTMVAAFQL MRKGNDPVKP RDDLDYAANF LYMLNEKEPD PLAARIFDIC
LILHVEHTMN ASTFSARVTA STLTDPYAVV ASAVGTLGGP LHGGANEEVI EMLEEISCVD
NVRSYIEDRL QKKAKIMGFG HRVYKVKDPR ATILQRLAEQ LFDKFGYDKY YEVAQEVEWV
MAEKVGSKGI YPNVDFYSGL VYRKMGIPTD LFTPVFAIAR VAGWLAHWKE QLAENRIFRP
TQVYNGRHEI TYTPIDKR