Gene Aazo_1385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1385 
Symbol 
ID9339180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1456147 
End bp1457409 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content35% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720754 
Protein GI298490577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0475901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTCAG CAATAGTCAG CAATAGATTA ATTCTATGTG TATTTGCCTT TAGTGTCAGT 
TTTGGTTTAA GCTTAGTCCC CAACTGGGAT TTTACTAAAG CCTTACTCAC AGCTATAATT
ACCGTCTTAG CAACCTATTC AGCCGCATTA TTTGTTGATA AACGGCGTAG AAATTATGAA
ATGCTAGTTT TAAGTTCTCT GCATAGAAGA ATTAGAGAAC TGGAAGGATT AAAATATCGG
ATTGCTAGAG AAATTAAACA AGTAGAAGAA CATAAAACCA TCCTCTATAC AGAATCACAA
CAACTACAAA ATCAGATCGC AGAATGTCGT AATCAAAGAG ATAGCCTTCA TCGAGAGCTA
GGGACCTTCG CCGGACAAAA AAAGCAGTTA GAATTAGAAA TTAATAATTT CACAACCGAA
CTGCATAAAC TACAAAATAA TACAGAGAAA CTTAATAATT CTTGCTCTGA ACTCGCAGCA
GAAAAACGCC GTCTAGAACT AAATTGTAAC GTATTTCGTT CAGAAATTAC CCAACTAAAA
GCACAAATTG AAGCCTTCAA GCAAGAAAAA CAGGAGATAG AAAATAACCT AGTTCTCATC
AATCGACTCA AACCCCAACT AGACGAAAAA CTTTATGAAC TGAGAATCGA AATTCACGAA
TTAGAAATCA AAGTATCCAA TCAGAATAAC TTACTCACAC ATACAATAAC CGCCAGAGAA
AATCTCGAAA ATATTCTCAC CGACCTATAC ACCAAAAAAC AAGCACAACA ATCAGAAATT
AGCCAATTAC ACAATCAAAT CTCTTTATTA CAAGATGAGC GCCACTTGTT GCAAAACCAA
GTTTGGGAAT TACTCCAAAA CATGGAAACT CTCGATCAAG ACGCATTAAC TGAAAACACA
CAAGAAGATC TTGAATTATT TCCTTTTGAT GAAATACTAG AACCCGTAGA TCCTTCAAGT
CCTACTTCAG ATAATTTACC ACCAGAATGG AGTAATTTTC TAGAAAAATT GCCAACAAAC
CAAATTCACG TATTAAAAGC CATAGTTGCT CAAGATAATC CCAAGGCTAT TATCAAGCAA
ATCGCCGAAG AACATATGAC CATGCCAAAT TTATTAATTG ATGCTATAAA TGAAGTTGCC
AATCTTACCC TTGGGGAACT AGTTATTAAA ACAGATGCAG AAATCCCCGA AATTTATCAA
GATCATATAC TAAATTTGAG AAAAATGCTC ACCAAGCATG AAGAATTAAT TACTCAATCC
TAA
 
Protein sequence
MQSAIVSNRL ILCVFAFSVS FGLSLVPNWD FTKALLTAII TVLATYSAAL FVDKRRRNYE 
MLVLSSLHRR IRELEGLKYR IAREIKQVEE HKTILYTESQ QLQNQIAECR NQRDSLHREL
GTFAGQKKQL ELEINNFTTE LHKLQNNTEK LNNSCSELAA EKRRLELNCN VFRSEITQLK
AQIEAFKQEK QEIENNLVLI NRLKPQLDEK LYELRIEIHE LEIKVSNQNN LLTHTITARE
NLENILTDLY TKKQAQQSEI SQLHNQISLL QDERHLLQNQ VWELLQNMET LDQDALTENT
QEDLELFPFD EILEPVDPSS PTSDNLPPEW SNFLEKLPTN QIHVLKAIVA QDNPKAIIKQ
IAEEHMTMPN LLIDAINEVA NLTLGELVIK TDAEIPEIYQ DHILNLRKML TKHEELITQS