Gene Aazo_4113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4113 
Symbol 
ID9341918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4181207 
End bp4182736 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content49% 
IMG OID 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_003722680 
Protein GI298492503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTAC CTTGGTACCG AGTACATACA GTTGTTCTGA ACGATCCAGG TCGACTGATT 
TCTGTACACC TGATGCACAC AGCACTAGTC GCCGGCTGGG CTGGTTCAAT GGCATTATAC
GAACTAGCTG TATACGACCC CAGTGATCCA GTTCTCAACC CCATGTGGCG ACAAGGGATG
TTTGTTCTTC CTTTCATGTC ACGTTTGGGC GTAATCAAAT CCTGGGGCGG TTGGAGTGTT
ACTGGTGGCA CAGCAGTAGA TCCTGGCTTC TGGTCATTTG AAGGCGTTGC TGCTGCTCAC
ATTGTTCTTT CCGGTTTGTT ATTCCTAGCA GCCGTTTGGC ACTGGGTTTA CTGGGATTTG
GAACTCTTCA GAGATCCTCG TACCGGCGAA CCTGCCTTAG ACTTGCCAAA AATGTTTGGC
ATCCACTTAT TCTTATCTGG TTTACTTTGC TTCAGCTTCG GTGCTTTCCA CCTCACCGGA
CTATTTGGTC CTGGGATGTG GGTATCCGAT GCCTTTGGTG TCACTGGCAG CATCCAACCA
GTAGCACCAG AATGGGGACC AGCCGGGTTT AACCCCTATA ACCCTGGTGG CATTGTCGCT
CACCACATTG CAGCTGGTGT AGTTGGTATT ATCGCTGGTT TATTCCACCT CACAGTCAGA
CCCCCCGAAA GGCTCTACAA AGCACTACGG ATGGGTAACA TTGAAACCGT ACTTTCCAGC
AGTATTGCTG CGGTGTTCTT CGCTGCTTTC GTAGTAGCTG GTACTATGTG GTACGGTAAC
GCTGCTACCC CCATCGAATT GTTTGGACCT ACCCGTTACC AGTGGGATCA AGGCTACTTC
CGTCAAGAAA TTGAGCGCCG TGTGCAAACC AGTGTTGCTC AAGGCACAAG TCTAAGTGAA
GCTTGGTCAC AAATCCCCGA AAAATTGGCC TTCTACGATT ACGTAGGTAA TAGCCCCGCT
AAAGGTGGTC TATTCCGTAC AGGTCCAATG GTTAAGGGTG ATGGTATTGC CCAATCTTGG
CAAGGCCACG CAGTATTCAC AGATGCAGAA GGACGTGAGT TAACTGTACG TCGTCTGCCT
AACTTCTTTG AAACCTTCCC AGTAATTTTG ACCGATAAAG ATGGAATTGT CCGCGCTGAC
ATTCCTTTCC GTCGGGCAGA ATCTAAATAT AGCTTTGAGC AAACAGGCGT TACTGTTAGC
TTCTACGGCG GCAATCTCAA CGGCAATACC TTTACAGATC CTGCTGACGT GAAGAAATAC
GCTCGTAAAG CTCAAGGTGG AGAAATATTT GAATTTGACC GCGAAACCTT AAACTCTGAT
GGTGTATTCC GTACATCTCC CAGAGGTTGG TTTACCTTTG GTCACGCGGT ATTTGCCCTA
CTGTTCTTCT TTGGACACCT CTGGCATGGT TCTCGGACAA TCTACCGTGA CGTCTTTGCC
GGTGTAGAAG CGGATCTGGA AGAGCAAGTT GAGTGGGGTC TGTTCCAGAA AGTTGGTGAC
AAGACCACTC GTACGCGTAA AGAAGCTTAA
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAVYDPSDP VLNPMWRQGM 
FVLPFMSRLG VIKSWGGWSV TGGTAVDPGF WSFEGVAAAH IVLSGLLFLA AVWHWVYWDL
ELFRDPRTGE PALDLPKMFG IHLFLSGLLC FSFGAFHLTG LFGPGMWVSD AFGVTGSIQP
VAPEWGPAGF NPYNPGGIVA HHIAAGVVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS
SIAAVFFAAF VVAGTMWYGN AATPIELFGP TRYQWDQGYF RQEIERRVQT SVAQGTSLSE
AWSQIPEKLA FYDYVGNSPA KGGLFRTGPM VKGDGIAQSW QGHAVFTDAE GRELTVRRLP
NFFETFPVIL TDKDGIVRAD IPFRRAESKY SFEQTGVTVS FYGGNLNGNT FTDPADVKKY
ARKAQGGEIF EFDRETLNSD GVFRTSPRGW FTFGHAVFAL LFFFGHLWHG SRTIYRDVFA
GVEADLEEQV EWGLFQKVGD KTTRTRKEA