Gene Aazo_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1060 
Symbol 
ID9338856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1132942 
End bp1134327 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content47% 
IMG OID 
Productphotosystem II 44 kDa subunit reaction center protein 
Protein accessionYP_003720540 
Protein GI298490363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000366993 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAACGC TCTCTAATAG ATCAGTTATA GGAGGCGGAC GTGATCAAGA ATCAAGCGGC 
TTTGCTTGGT GGTCTGGAAA CGCTCGTTTA ATTAACCTAT CTGGTAAACT GCTTGGCGCT
CACGTTGCCC ATGCTGGTTT AATCGTCTTC TGGGCTGGAG CAATGACTTT ATTTGAAGTT
GCTCACTTTA TCCCAGAAAA GCCCATGTAC GAACAGGGCT TGATTCTTCT GCCTCACATT
GCTACATTAG GTTGGGGCGT TGGTGCTGGT GGTGAAGTAA TTGACACCTT CCCCTACTTT
GTTGCTGGTG TACTGCACCT GATTTCCTCT GCTGTACTGG GTTTTGGTGG CATCTATCAT
GCCGTTCGTG GCCCAGAAAC ATTAGAAGAA TATTCTTCCT TTTTCGGTTA CGACTGGAAA
GACAAGAACA AAATGACCAA CATCATCGGC TTCCACCTTA TCATCTTGGG ATGTGGTGCG
CTGCTGTTGG TGTTAAAGGC TATGTTTTTC GGCGGTGTCT ATGATACCTG GGCACCCGGT
GGTGGTGATG TGCGTGTTAT TACTAACCCC ACACTCAATC CAGCGATCAT CTTTGGTTAT
CTAATTAAAG CTCCCTTCGG TGGCGAAGGC TGGATTGTTA GCGTTGATAA CATGGAAGAT
GTTATCGGTG GTCACATTTG GATTGCTTTA ATTTGTATTT CCGGTGGTAT TTGGCACATC
TTCACTAAGC CTTTTGCTTG GGCGCGTCGC GCTTTCATCT GGTCTGGTGA AGCTTACCTT
TCCTACAGCT TGGGCGCTCT TTCCTTGATG GGCTTTATCG CTTCCATCAT GGTTTGGTAC
AACAACACTG TTTACCCCAG CGAATTCTTC GGTCCTACTG GTCCTGAAGC TTCTCAAGCA
CAAGCTTTAA CCTTCTTGAT TCGTGACCAA CGCTTAGGTG CTAACGTTGG TTCTGCTCAA
GGGCCTACTG GTCTAGGTAA ATACTTGATG CGTTCTCCTA CTGGTGAAAT CATCTTCGGT
GGTGAAACCA TGCGCTTCTG GGATTTCCGT GGTCCTTGGT TAGAGCCTCT CCGTGGTCCT
AACGGTCTTG ACCTCGAGAA AATCAAGAAT GATATTCAGC CTTGGCAAGC TCGTCGTGCT
GCTGAATACA TGACTCACGC TCCTCTAGGT TCTTTGAACT CTGTAGGTGG TGTAGCTACT
GAAATCAACT CTTTCAACTA TGTATCTCCT CGTGCGTGGT TGTCTACCTC TCACTTCGTA
TTAGGTTTCT TCTTCCTTAT CGGTCACTTG TGGCATGCAG GACGCGCTCG TGCGGCTGCT
GGTGGTTTTG AGAGAGGTAT TGACCGTGAG AACGAACCAG CATACAGCAT GAAGGATCTT
GACTAG
 
Protein sequence
MVTLSNRSVI GGGRDQESSG FAWWSGNARL INLSGKLLGA HVAHAGLIVF WAGAMTLFEV 
AHFIPEKPMY EQGLILLPHI ATLGWGVGAG GEVIDTFPYF VAGVLHLISS AVLGFGGIYH
AVRGPETLEE YSSFFGYDWK DKNKMTNIIG FHLIILGCGA LLLVLKAMFF GGVYDTWAPG
GGDVRVITNP TLNPAIIFGY LIKAPFGGEG WIVSVDNMED VIGGHIWIAL ICISGGIWHI
FTKPFAWARR AFIWSGEAYL SYSLGALSLM GFIASIMVWY NNTVYPSEFF GPTGPEASQA
QALTFLIRDQ RLGANVGSAQ GPTGLGKYLM RSPTGEIIFG GETMRFWDFR GPWLEPLRGP
NGLDLEKIKN DIQPWQARRA AEYMTHAPLG SLNSVGGVAT EINSFNYVSP RAWLSTSHFV
LGFFFLIGHL WHAGRARAAA GGFERGIDRE NEPAYSMKDL D