Gene Aazo_3741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3741 
Symbol 
ID9341546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3798494 
End bp3800959 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content43% 
IMG OID 
Productsurface antigen (D15) 
Protein accessionYP_003722407 
Protein GI298492230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTAT CTCCGATGTT ACTGGCAGTT GTGGCGATCA CAACCCCTTT TGATAGTTCA 
TTGAGTGCAA CTGCACAAAC TCCTGATTAT ACTCAGCAAG TAACAGAAGT TGCCACGGTA
GGAACAAACC AACATCTGCC ACAGAAAAGC AGTCAGAATC ATCTTGATCA ACAACAAGAA
TCAGTCATAG TCACGAAAAC AGAAGTTAGA GAACCCGAGT CTCCATCCTC TATAACTCCC
TCCATAACTC CTATTTCAGC CTTAACCATT GACTCAACAA CAATAGCAAC ACCAGAAGTA
ATTCCTCCAA ATATCAGCAC CTCATCAAAA ACAGCACAAG CTTTAAAAAA AGCAATAGCT
CCTAACTCAG CAAACAGAGA AAGGAAAGCG GTAATTATCC CCACAGCACA AACCCTAAAA
GCATCGCTAT TATCTACCAA TTCAGTAGGG ATAGAGACAT CACAAGCAAT TATACCCAAT
ATCACCACCC CATCCAAAAC AGCACAAGAG CAAGAATCAG AAGCAACTGC TGACACAAAT
TATATACAGG AAAACGTCGA ACCACCCACA GCACAACCAG CTTTCCCGAC TAATCCCGAA
ACAACACCAA CAGCAGAACC TCTTGTATTA GTTTCAGAAG TAGCAGTCAA ATCTCTAACT
GGTGCTATTG CAAAAGAACT AGAAGATAAA GTTTATCAAG CCATTCGTAC CCAAGCAGGA
CAAACAACTA CTCGCTCCCA ACTCCAAGAA GATATTAATG CTATCTTTGC CACGGGCTTT
TTCTCTAACG TGCAGGCCAT GCCAGAAGAT ACGCCCTTGG GTGTAAGGGT AAGTTTCGTT
GTTAGTCCTA ACCCGGTTCT GAGTAAAGTA CAAGTAGAAG CTAATCCGGG AACTGGTGTA
GCTTCAGTCA TACCTGCCAA AACTGTAGAT GAAATCTTCA GTAAACAGTA TGGACAAATC
TTGAATTTGC GAGATTTACA AGAAGGAATT AAAGAACTAA CCAAAAAGTA TCAAGACCAA
GGTTATGTGC TGGCCAACGT GATTGGAGCA CCTAAAGTAT CAGAGAATGG AGTTGTTACC
CTACAAGTAG CAGAAGGGGT AGTAGAAAAT ATCAAAGTCC GTTTCCGCAA CAAACAGCGT
GAGGAAGTAG ACGACAAAGG TAATCCCATT CGCGGACGAA CAAAAGATTA TGTAATTAAG
CGAGAATCCG AATTAAAGCC TGGTCAGGTA TTTAACCGCA ACATCGTGCA GAAAGACCTG
CAAAGGATAT TTGGTTTAGG ATTATTTGAA GATGTAAGTG TGTCCCTTGA TCCTGGTACA
GATCCAAGTA AGGTAGATGT GGTACTCAAT GTGGCGGAAC GCAGTAGTGG TTCAATCGCT
GCTGGTGCGG GTATTAGTTC TGCCACTGGG TTATTTGGTA CTGTTAGCTA TCAACAGCAA
AACCTGGGGG GTAGAGCGCA GAAACTGGGG GCTGAGGTAC AGTTAGGAGA AAGAGAATTG
CTGTTTGACC TGCGGTTTAC AGATCCTTGG ATTGCTGGTG ATCCTTACCG TACTTCCTAC
ACAGCTAATA TTTTCCGTCG TCGTTCAATT TCTCTGATTT TCGAAGGTAA AAATGACGCT
ATAGAAACCT TTGACCCTAG TGATATTACT AATGAAGATG ACCAGGATCG CCCCCGAATT
ACCCGTTTAG GCGGTGGTGT ATCCTTTACC CGTCCTCTTG CTGCTAATCC TTACCAAAAT
TCAGAATGGA CAGCTTCAGC AGGTTTGCAG TATCAACGAG TTTCTAGCCG TGATGCTGAT
GGCAATCTGA GAAAAGAAGG GGCGATATTT GATGATAATG GCAATCAAAT TAGCCCTACA
ATTCCTCTGA CGCAATCAGG TACAGGTGAA GACGATTTGC TATTATTGCA ACTGGCCGCA
CAACGTGATC GCCGTAATAA TCCCTTACAA CCTACCAATG GTTCTTACCT CCGCGTCGGA
GTTGACCAAT CTGTACCCGT GGGACAAGGC AATATTTTAC TGACTAGGCT ACGGGGTAAC
TACAGCCAAT ATTTACCAGT AAAATTCATC GGCTTTGGTA AAGGCGCACA AACCCTAGCA
TTTAACCTCC AAGGGGGTAC AATTCTTGGT GATGTACCTC CCTACGAAGC CTTTACCCTT
GGTGGTAGTA ATTCTGTGCG GGGTTACGAT GAAGGGAGAT TAGCAACTGG ACGTAGCTAT
ATACAAGCAT CTGTTGAGTA TCGTTTTCCT GTCTTTTCTG TAGTCAGTGG CGCTCTATTT
TTTGATTACG GTAGTGACCT GGGAAGCAAT ACCAGAACAG CAGAAATTTT GAACAAAAAT
GGTACTGGCT ATGGTTATGG TCTAGGTGTG CGTGTACAGT CACCATTAGG ACCAATTCGT
ATAGACTACG GTATGAGCGA TGATGGCGAT AGCCGCATTA ACTTCGGGAT AGGGGAAAGG
TTTTAA
 
Protein sequence
MHLSPMLLAV VAITTPFDSS LSATAQTPDY TQQVTEVATV GTNQHLPQKS SQNHLDQQQE 
SVIVTKTEVR EPESPSSITP SITPISALTI DSTTIATPEV IPPNISTSSK TAQALKKAIA
PNSANRERKA VIIPTAQTLK ASLLSTNSVG IETSQAIIPN ITTPSKTAQE QESEATADTN
YIQENVEPPT AQPAFPTNPE TTPTAEPLVL VSEVAVKSLT GAIAKELEDK VYQAIRTQAG
QTTTRSQLQE DINAIFATGF FSNVQAMPED TPLGVRVSFV VSPNPVLSKV QVEANPGTGV
ASVIPAKTVD EIFSKQYGQI LNLRDLQEGI KELTKKYQDQ GYVLANVIGA PKVSENGVVT
LQVAEGVVEN IKVRFRNKQR EEVDDKGNPI RGRTKDYVIK RESELKPGQV FNRNIVQKDL
QRIFGLGLFE DVSVSLDPGT DPSKVDVVLN VAERSSGSIA AGAGISSATG LFGTVSYQQQ
NLGGRAQKLG AEVQLGEREL LFDLRFTDPW IAGDPYRTSY TANIFRRRSI SLIFEGKNDA
IETFDPSDIT NEDDQDRPRI TRLGGGVSFT RPLAANPYQN SEWTASAGLQ YQRVSSRDAD
GNLRKEGAIF DDNGNQISPT IPLTQSGTGE DDLLLLQLAA QRDRRNNPLQ PTNGSYLRVG
VDQSVPVGQG NILLTRLRGN YSQYLPVKFI GFGKGAQTLA FNLQGGTILG DVPPYEAFTL
GGSNSVRGYD EGRLATGRSY IQASVEYRFP VFSVVSGALF FDYGSDLGSN TRTAEILNKN
GTGYGYGLGV RVQSPLGPIR IDYGMSDDGD SRINFGIGER F