Gene Aazo_2421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2421 
Symbol 
ID9340220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2518714 
End bp2519934 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content45% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721473 
Protein GI298491296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.762452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACAAG AACCCAGCAC CCATCTAATT ATTTCCGAAG CAGTAGAGGA CTTAATCCCT 
AAGGAAACCC TGTCGATAGA GACTTACGCT GAAGGCTTAA TGGATGATCT CTTCACCGAG
ATCGACAACA TTCTGGAGGG TCGTAGAAAA CGTCCTAACA AAGCTGTGAG GGCAGAGTAC
ATACCCATGC AAACAGTGAC GATGAAAATA CCAGAAGTGG TTTTGCCACA AACAGTACAT
CGCGCAGTTC AAGCAATGTC GTCAAGCAAA AACGAGCAAA CGAGTACACT GGTATTCAAT
TCTGCTTCTG TAACTGCAAT TACTAAAAGG ACTGAACAAC ACCGTGGTAG TTTAACTAAC
CTGCTAATTC TGGGAGCAAC CTTAAGTGCA GCAATGGTGG GTACACTTTA TTTAACAGAG
TCGGGATTAC TCACAAATAT TACCGCTAAA ATAATTCCGC AGACGCTACA AACAAGCCAG
CTACAATCGC CCGTCCTCCT ACCACCAGAT CCTCTAGGGG AATTGGTGAA TTATATGCTG
GAAGCCTTGG CTGTCATTGA CCAGCAAAGC AACACTCCCA CACAGGCTAA ATCTGGTTTC
CCTGATGTCA ACCTCAGCCA GTCCAACTCC TTAGCCTTGG CGAACACCCA ACCTGTTGGT
ACTTTACCTC CACCAGTTGC AGCGGACAAT GTATCAATTG TTCCCAGTCG GGTAAGAAAT
GTTATCGAAC GGGTATATGT CCCTGTTTAC CAAGCACCTC CACTTATAAA CCCATTACCA
CAAGTACCAT CGTTGCCAGG GCAGGTTTCT CTACCTCAGT CTGTACAAGA TACACCCCAA
AATGTGCAAG CAGAGGCAAA GCCAATACCG GAAAAAATGT CACCAGCAAC TGTAAAACAA
GCAGTTAATC CCCTCCCAAC CCGCATAGCA CCACCAAAAC TACCTACTGC GACAATAACT
ATACCAGCAG CCAAGCCTGA AGCAGCATCA ACTACAGTCC AGCAGGTTTA TTTACCTGCC
TATTCCGCAG AGTTAGAGGG ATTGTTAGAG TTAGGTAAAA AGTCTGCGGC TTTATTTAAA
GTTGATGGTG TGACTCGTCG TATTAATTTA GGACAGAGTA TTGGTCCCAG TGGCTGGACA
TTGGTAGAGG TGAGTAATGG TGAAGCAATT ATCCGCCGTA ATGGTGAGGT ACGCTCAATT
TATACTGGTC AAAAACTGTA A
 
Protein sequence
MIQEPSTHLI ISEAVEDLIP KETLSIETYA EGLMDDLFTE IDNILEGRRK RPNKAVRAEY 
IPMQTVTMKI PEVVLPQTVH RAVQAMSSSK NEQTSTLVFN SASVTAITKR TEQHRGSLTN
LLILGATLSA AMVGTLYLTE SGLLTNITAK IIPQTLQTSQ LQSPVLLPPD PLGELVNYML
EALAVIDQQS NTPTQAKSGF PDVNLSQSNS LALANTQPVG TLPPPVAADN VSIVPSRVRN
VIERVYVPVY QAPPLINPLP QVPSLPGQVS LPQSVQDTPQ NVQAEAKPIP EKMSPATVKQ
AVNPLPTRIA PPKLPTATIT IPAAKPEAAS TTVQQVYLPA YSAELEGLLE LGKKSAALFK
VDGVTRRINL GQSIGPSGWT LVEVSNGEAI IRRNGEVRSI YTGQKL