Gene Aazo_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4038 
Symbol 
ID9341843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4097266 
End bp4098486 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content44% 
IMG OID 
Productgeranylgeranyl reductase 
Protein accessionYP_003722626 
Protein GI298492449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.670598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACACTAC GGGTTGCTGT TGTTGGTTCA GGCCCAGCTG GTTCATCTGC TGCTGAGACA 
TTAGCAAAAG CTGGGATTGA AACTTATTTA ATTGAGCGCA AGCTGGATAA CGCTAAGCCT
TGCGGGGGTG CTATTCCCCT ATGTATGGTG AGTGAGTTTG ACCTACCTCC AGAGATTATC
GACCGTCGAG TGCGGAAGAT GAAAATGATT TCTCCTTCTA ATCGTGAGGT AGATATCAAT
CTGGTAAATG AAGAAGAATA TATAGGAATG TGCCGCCGTG AAGTATTGGA TGGATTCCTA
CGCGAACGGG CGGCAAAACT AGGTGCTAAT TTAATTAACG CCACTGTTCA TAAACTTGAT
ATACCCACAA ACAACACTGA CCCCTATACA ATCCATTACG TTGACCATAC AGAAGGTGGG
GCACAAGGGA TTACGAAAAC ACTGAAGGTA GATTTAGTGA TTGGTGCTGA TGGGGCAAAT
TCCCGCATTG CTAAAGAAAT GGATGCTGGG GATTACAATT ATGCGATCGC ATTCCAAGAA
CGCATTCGTC TACCCCAAGA CAAAATGGCC TACTACAACG ACATGGCCGA AATGTATGTG
GGTAATGACG TTTCTACCGA CTTCTATGCT TGGGTATTTC CCAAATATGA TCACGTAGCT
GTTGGTACAG GAACAATGCA GGTTAATAAA GCCAACATCA AACAGTTACA AGCGGGTATT
CGCGCCCGTG CTTCTAAAAA ATTAGCTGGT GGTCAAATTA TCAAAGTCGA AGCCCACCCC
ATCCCTGAAC ATCCCCGTCC TCGTCGTGTA GTTGGACGTA TTGCGTTGGT AGGTGATGCT
GCTGGTTATG TCACCAAGTC CTCTGGTGAA GGTATCTATT TCGCGGCTAA ATCTGGACGG
ATGTGTGCAG AAACCATTGT GGAAGTTTCT AACAATGGTG TGCGTATTCC TACAGAAAAC
GACTTGAAGA TTTACCTGAA GCGTTGGGAT AAGAAATACG GACTCACTTA CAAGGTATTG
GATATTCTTC AAACCGTGTT CTATCGTTCC GATGCTACCC GTGAAGCATT TGTAGAAATG
TGTGATGACA TGGATGTACA ACGGCTAACA TTTGATAGCT ATTTATACAA AACAGTAGTT
CCAGCTAACC CCATCACTCA ACTCAAAATT ACTGCCAAAA CCATCGCTAG TTTATTACGC
GGTAATGCCC TTGCACCTTA A
 
Protein sequence
MTLRVAVVGS GPAGSSAAET LAKAGIETYL IERKLDNAKP CGGAIPLCMV SEFDLPPEII 
DRRVRKMKMI SPSNREVDIN LVNEEEYIGM CRREVLDGFL RERAAKLGAN LINATVHKLD
IPTNNTDPYT IHYVDHTEGG AQGITKTLKV DLVIGADGAN SRIAKEMDAG DYNYAIAFQE
RIRLPQDKMA YYNDMAEMYV GNDVSTDFYA WVFPKYDHVA VGTGTMQVNK ANIKQLQAGI
RARASKKLAG GQIIKVEAHP IPEHPRPRRV VGRIALVGDA AGYVTKSSGE GIYFAAKSGR
MCAETIVEVS NNGVRIPTEN DLKIYLKRWD KKYGLTYKVL DILQTVFYRS DATREAFVEM
CDDMDVQRLT FDSYLYKTVV PANPITQLKI TAKTIASLLR GNALAP