Gene Aazo_5168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5168 
Symbol 
ID9342975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5292928 
End bp5294511 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content42% 
IMG OID 
Productradical SAM domain-containing protein 
Protein accessionYP_003723343 
Protein GI298493166 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTT TGCTAGTTTA TCCAATATTT CCCAAAACCT TTTGGTCCTA TGAAAAAGTC 
CTAGCACTGG TAGACAGAAA GGTTTTGTTA CCACCATTGG GTTTAGTCAC AGTAGCGGCG
ATTTTGCCCC AAGAATGGGA ATTTAAACTT GTAGATCGTA ACATCCGCGC TGCCACCGAA
GAAGAATGGG CATGGGCAGA TATAGTCATA TTCTCGGCTA TGATTGTCCA AAAACAAGAC
TTACTAGATC AAATTCGGGA AGCAAAAAGA CGTGGTAAGT TAGTGGCTTT GGGTGGACCC
TATCCCACAT CTACAGCCGA TGAAGTAGAA GCAGCAGGGG CAGATTTCCT AATCTTGGAT
GAAGGAGAAA TCACTTTACC CATGTTTGTG GAAGCTGTAC AAAAAGGTGA AAAATCTGGA
GTTTTCCGCG CTACAGAAAA ACCTGATGTC ACAGGTACAC CCATTCCCCG CTTTGATTTA
TTAGAATCTG ATGCCTATGA TATGATGTCG GTGCAGTTTT CGCGTGGTTG TCCCTTCCAG
TGCGAATTTT GCGACATCAT TGTATTATAT GGACGCAAAC CCAGAACCAA AACACCCGCA
CAACTGTTAG CAGAATTAGA TTATCTGTAT GAGTTGGGTT GGAGACGTGG TGTATTCATG
GTAGATGATA ACTTTATTGG CAACAAACGC AATGTGAAAT TGTTGCTGAA AGAGTTAAAA
GTTTGGATGG CTGAACATCA ATATCCCTTC AATTTTGACA CAGAAGCTTC CATTGACTTG
GCACAAGATG CAGAGATGAT GGAGTTGATG GTTGATTGCG GATTTAAAGC AGTATTTTTG
GGTATTGAAA CACCAGATGA AGATAGTTTA CAACTAACTA AGAAATTCCA AAATACTCGC
AGTTCTTTAA CTGAGTCTGT AGAAACTATC ATTAAAGCTG GACTGCGGCC AATGGCTGGG
TTTATTATTG GTTTTGATGG TGAAAAAGCC GGTGCAGGCG ATCGCATCGT CAGATTTGCA
GAACAAGCAG CCATCCCTTC TACCACCTTT GCTATGTTAC AAGCATTACC CAACACTGCA
TTGTGGCATC GCCTAAAAAA AGAAGGCAGA CTGCGGGAAA ATAAAGACGG AAACATCAAT
CAAACCACAT TGATGAATTT TATTCCCACC CGTCCCCTAG AAGAACTTGC TAGGGAATAT
GTGGAAGCCT TCTGTGCTTT ATATGACCCA GTGGCATATT TAGATCGCAC CTATCGCTGT
TTCTTAATGT TGGGTTCTCC CGAATGGACA GCACCAGCTA AAACGCCAGA ATGGGTAGTT
ATCAAAGCAC TGCTAATTGT AATTTGGAGA CAAGGTTTTA AACGGGAAAC CCGCTGGAAA
TTCTGGCATC ACTTCTTGAG CATTCTCAAG CATAACCCCA AAGTAATTGA ACAGTACGTT
TCTACTTGCG CCCACATCGA ACATTTTATG GAATATCGGC AAATTGTGCG CGATGAAATT
GAAAGTCAAT TAGCCGCTTA TTTAGCCCAA GGTGCAGAAA AACCTTATAT TCCAGAAAAG
GAAAAAGTAG AAGCGGTAGT TTAG
 
Protein sequence
MRVLLVYPIF PKTFWSYEKV LALVDRKVLL PPLGLVTVAA ILPQEWEFKL VDRNIRAATE 
EEWAWADIVI FSAMIVQKQD LLDQIREAKR RGKLVALGGP YPTSTADEVE AAGADFLILD
EGEITLPMFV EAVQKGEKSG VFRATEKPDV TGTPIPRFDL LESDAYDMMS VQFSRGCPFQ
CEFCDIIVLY GRKPRTKTPA QLLAELDYLY ELGWRRGVFM VDDNFIGNKR NVKLLLKELK
VWMAEHQYPF NFDTEASIDL AQDAEMMELM VDCGFKAVFL GIETPDEDSL QLTKKFQNTR
SSLTESVETI IKAGLRPMAG FIIGFDGEKA GAGDRIVRFA EQAAIPSTTF AMLQALPNTA
LWHRLKKEGR LRENKDGNIN QTTLMNFIPT RPLEELAREY VEAFCALYDP VAYLDRTYRC
FLMLGSPEWT APAKTPEWVV IKALLIVIWR QGFKRETRWK FWHHFLSILK HNPKVIEQYV
STCAHIEHFM EYRQIVRDEI ESQLAAYLAQ GAEKPYIPEK EKVEAVV