Gene Aazo_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2153 
Symbol 
ID9339952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2235547 
End bp2236977 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content47% 
IMG OID 
Productribulose-bisphosphate carboxylase 
Protein accessionYP_003721290 
Protein GI298491113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTACG CGCAAACTAA GACTCAGGCT AAATCTGGGT ATCAAGCTGG GGTTAAAGAT 
TACAGACTAA CTTATTACAC TCCCGATTAC ACACCTAAAG ATACAGATAT TCTTGCAGCA
TTCCGGATGA CTCCTCAGCC TGGAGTTCCA CCCGAAGAAG CTGGTGCTGC GGTAGCAGCT
GAGTCTTCTA CAGGTACTTG GACAACTGTA TGGACAGACT TGCTCACCGA CCTAGACCGT
TACAAAGGTC GTTGCTACGA TATCGAACCA GTTCCTGGTG AAGATAACCA ATACATTGCT
TACGTTGCTT ATCCTCTGGA CTTGTTTGAG GAAGGTTCTG TAACCAATAT GTTGACTTCT
ATCGTAGGTA ACGTATTCGG TTTCAAAGCA TTACGTGCAT TACGTTTAGA AGACTTGCGG
ATTCCAGTTG CTTACTTGAA GACTTTCCAA GGTCCTCCTC ATGGTATTCA AGTTGAACGC
GACAAGTTGA ACAAGTATGG TCGTCCTTTG TTGGGTTGTA CGATCAAGCC CAAATTGGGT
TTGTCTGCGA AGAACTATGG ACGCGCTGTA TACGAATGTT TGCGCGGTGG TTTGGACTTC
ACCAAAGACG ACGAAAACAT TAACTCTGCA CCATTCCAAA GATGGCGTGA TCGCTTCCTA
TTCGTAGCAG AAGCAATCCA CAAAGCACAA GCAGAAACCG GTGAAATCAA GGGCCACTAC
CTAAACGTAA CCGCACCTAC CTGCGAACAA ATGCTGCAAC GGGCTGAGTA CGCCAAAGAA
CTCAATATGC CTATCATCAT GCATGACTAC CTGACCGCAG GTTTCACAGC TAACACAACC
TTGGCTCACT GGTGTCGTAA CAATGGTGTA TTACTACACA TCCACCGTGC TATGCATGCT
GTTATTGACC GTCAAAAGAA CCACGGTATT CACTTCCGTG TATTAGCTAA GACGTTACGT
ATGTCTGGTG GAGACCACAT TCACACCGGT ACAGTTGTTG GTAAGTTGGA AGGTGAACGC
GGCATCACAA TGGGCTTCGT TGACCTACTA CGTGAAAACT ACGTTGAACA AGACAAGTCT
CGTGGTATCT ACTTCACCCA AGATTGGGCT TCTATGCCTG GTGTAATGGC AGTTGCTTCC
GGTGGTATCC ACGTATGGCA CATGCCCGCA CTCGTAGAAA TCTTTGGTGA TGACTCCGTA
CTACAGTTTG GTGGTGGGAC ACTTGGTCAC CCATGGGGTA ACGCTCCTGG TGCAACCGCT
AACCGTGTAG CCCTAGAAGC TTGTATCCAA GCTCGTAACG AAGGACGTAA CTTAGCTCGT
GAAGGTAACG ATATTATCCG CGAAGCTGCT AAGTGGTCTC CTGAACTGGC CGTTGCTTGC
GAACTGTGGA AAGAAATCAA GTTCGAGTTT GAAGCAATGG ATACCGTCTG A
 
Protein sequence
MSYAQTKTQA KSGYQAGVKD YRLTYYTPDY TPKDTDILAA FRMTPQPGVP PEEAGAAVAA 
ESSTGTWTTV WTDLLTDLDR YKGRCYDIEP VPGEDNQYIA YVAYPLDLFE EGSVTNMLTS
IVGNVFGFKA LRALRLEDLR IPVAYLKTFQ GPPHGIQVER DKLNKYGRPL LGCTIKPKLG
LSAKNYGRAV YECLRGGLDF TKDDENINSA PFQRWRDRFL FVAEAIHKAQ AETGEIKGHY
LNVTAPTCEQ MLQRAEYAKE LNMPIIMHDY LTAGFTANTT LAHWCRNNGV LLHIHRAMHA
VIDRQKNHGI HFRVLAKTLR MSGGDHIHTG TVVGKLEGER GITMGFVDLL RENYVEQDKS
RGIYFTQDWA SMPGVMAVAS GGIHVWHMPA LVEIFGDDSV LQFGGGTLGH PWGNAPGATA
NRVALEACIQ ARNEGRNLAR EGNDIIREAA KWSPELAVAC ELWKEIKFEF EAMDTV