Gene Aazo_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3074 
Symbol 
ID9340878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3165182 
End bp3167044 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content39% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003721956 
Protein GI298491779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTT GTCAAAATCC CAATTGTTCA AACCCATTCA ATCCTGACTA CAGTAAATTT 
TGCATCGTTT GTGGACACGG TACATTTGGA CAAATCCTCA GAAACCGTTA CCGTGTTTTG
CGTTTATTAG GAGAAGGTGG GTTTAGCAAA ACTTACGCCG CAGAAGATGT AGACAGACTC
AACGCACCTT GCGTTATTAA GCAATTTTTC CCCCAAGTTC AGGGAACAGG ACAACGTGCA
AAAGCAGCAG AATTTTTCAA GGAAGAGGCT TTCAGATTAT ATGAACTTGG AGAAAATCAT
CCCCAAATTC CTCGTTTATT AGCTTACTTT GAACAAGGTG CTAGTTTATA TTTAGTCCAG
GAATTTATTA TTGGCAAAAC CCTCCTAGAA GAAGTTCAAG AACAGGCCTA TAGCGAAGCA
GAAATTCGTA AACTTTTGTT AGATTTATTA CCAGTTCTTG ATTTTATTCA CCAAAGAAAC
GTAATTCATC GGGATATTAA ACCCGAAAAT ATTATTCGTA GAGATACAGA TCAAAAACCT
GTATTAATTG ACTTTGGTGG TGCAAAACAA GTAACTCACA CCAGCATAGC TAGACAAGCC
ACAGCTATTT ATACTTTAGG TTATGCACCA ACCGAACAAA TGGCAGGTTT TGCTTGTCAT
GCAAGTGACT TATATGCTTT GGGTGTAACC TGTGTCAGGT TATTAACTCA AGATTTACCC
ATGCAAGATA CCTATGGACT TAAAGATCCT CTTTATGATC CTATGACTGC GAAATGGTTA
TGGAAAGAAC GTTTACAGGT AAAAAGTATT ACCATCAGTC AGGAATTAAT ATACATTTTA
GATCAATTAC TGCAACATTT CCCCAACGAT AGATATCAGT CAGCAGCAGA GGTTTTAGAT
GATTTAACAG CGGAACTATC ACTGCCATTA GGATTAGAAC CAGTAAGTTC AGCAATTTTC
CATATTATCA AAACACCATT AACACCTCCA GGTCCAGCAC GAAAAGTTAT CGTTCCCTTA
CTACCTTTAA AAGCTTTTGA TTTTGATGTA GTTACAGTAG ACACAGCAGG AAAAGAAACT
AGTTATGACA CACTTAGCGC CAAATTGTTT CTAGAACAAT TAAACAAAAA CGTCGCTTTA
GAAATGGTAT CAATTCCTGG TAGTAGTTTT CTGATGGGTT CACCAGAATT TGAAGGTGAT
GCTGAGGAAT ATCCGCAACA TCAAGTCACA GTTAAACCTT TTTTCATGGC GAAATATCCC
ATTACTCAAG CACAGTGGAA AGCAGTTGTA GCATTACCGC AAGTCACCCA AGCTTTAAAC
TCCAAGCCAT CAAAATTTAA AGGTGCAAAT TTACCCATAG AGAATATTTC TTGGTATGAA
GCGGTGGAAT TTTGTTTGCG GTTATCAATG AAAACTGGAC GAAATTATCG TTTACCTAGT
GAAGCTGAAT GGGAATATGC TTGTCGTGCG GGAACTACTA CTGCTTTTCA TTTTGGGGAA
AGAATTACTT CTGATTTAAT TAATTGTAGC GGTGGTGATT TTTATATTGT CCCAACCAAA
AGCGACTTCC GTAAACAAAT CACAAATGTC GGCAGTTTTG ATATAGCGAA TGCTTTTGGT
TTATATGATA TGCATGGGTT AGTTTGGGAA TGGTGTGCTG ATCCCTGGCA TAACAATTAT
GAAGGTGCAC CGACCAATGG TAGTATTTGG GATGTTGATG GTGATATACA TCGTCGGGTT
TTGCGCGGTG GTGCTTGGAA TTTCAATGCA GAACTTTGTC GCAGTGCTAG TCGCAGTTGG
AATGAAGCAG AAGGTGGTTT AAGAATGTCG GGTTTGCGAG TGGTGTTTTC AGTTGAGGAA
TGA
 
Protein sequence
MQICQNPNCS NPFNPDYSKF CIVCGHGTFG QILRNRYRVL RLLGEGGFSK TYAAEDVDRL 
NAPCVIKQFF PQVQGTGQRA KAAEFFKEEA FRLYELGENH PQIPRLLAYF EQGASLYLVQ
EFIIGKTLLE EVQEQAYSEA EIRKLLLDLL PVLDFIHQRN VIHRDIKPEN IIRRDTDQKP
VLIDFGGAKQ VTHTSIARQA TAIYTLGYAP TEQMAGFACH ASDLYALGVT CVRLLTQDLP
MQDTYGLKDP LYDPMTAKWL WKERLQVKSI TISQELIYIL DQLLQHFPND RYQSAAEVLD
DLTAELSLPL GLEPVSSAIF HIIKTPLTPP GPARKVIVPL LPLKAFDFDV VTVDTAGKET
SYDTLSAKLF LEQLNKNVAL EMVSIPGSSF LMGSPEFEGD AEEYPQHQVT VKPFFMAKYP
ITQAQWKAVV ALPQVTQALN SKPSKFKGAN LPIENISWYE AVEFCLRLSM KTGRNYRLPS
EAEWEYACRA GTTTAFHFGE RITSDLINCS GGDFYIVPTK SDFRKQITNV GSFDIANAFG
LYDMHGLVWE WCADPWHNNY EGAPTNGSIW DVDGDIHRRV LRGGAWNFNA ELCRSASRSW
NEAEGGLRMS GLRVVFSVEE