Gene Aazo_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3964 
Symbol 
ID9341768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4026052 
End bp4027737 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content41% 
IMG OID 
ProductABC-1 domain-containing protein 
Protein accessionYP_003722579 
Protein GI298492402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.871344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAG GTTATTCAGA TAAAGCATAC CGTTGGAATC GGGAACATTA TTCTAGCAAA 
CGTCGCTTTG TAGATATTTG GTCTTTTGTT TTGACTTTAA TGTTTAAGCT TTGGCGCTAT
AACAAAGCTT GGACTTATCC AGGTGGTATG ACAGAGCCTA AACAGGCTAC AAGACGTAAA
GTTCAAGCTA TTTGGATTAG AAATACTTTT TTGGATTTGG GTCCGACTTT TATTAAGGTG
GGACAATTGT TTTCTACTCG TGCGGATATC TTCCCTAGTG AATATGTGGA AGAACTATCT
AAGTTACAAG ATAGAGTTCC AGCATTTAGC TATGAGCAGG TAGAAACGAT TATTGAAGAA
GAATTAGGGA AAAAAGTTCC CCAATTGTTC CATAGTTTTG AACCCATTCC TTTGGCTGCT
GCTAGTTTGG GACAAGTTCA TAAAGCTGTG CTGCATACTG GGGAGTCTGT TGTTGTGAAG
GTACAACGTC CAGGATTAAA AAAACTGTTT GAAATTGATT TAAAAATTCT CAAAGGTATT
GCTAGTTATT TCCAAAATCA TCCTAAATGG GGACATGGAC GGGATTGGAT GGGTATATAT
GAAGAATGTT GTCGCATTCT TTGGGAAGAG ATTGATTATC TGAATGAAGG CCGCAATGCT
GATACTTTTC GTCGCAACTT TCGTGCTTAT AATTGGGTGA AAGTACCACG AGTTTATTGG
CGTTATGGTA CTTCTAGGGT AATTACCTTG GAATATATGC CCGGTATTAA AGTTAGCCAA
TATGAGGCTT TAGAAGCGGC AGGTGTGGAT AGAAAGGCGA TCGCTCGTTA TGGCGCACAA
GCATATTTAC ACCAATTACT CAATAATGGT TTCTTCCATG CTGATCCTCA CCCTGGTAAT
CTCGCGGTTA GTCCCGACGG AGCGTTGATT TTCTACGATT TCGGGATGAT GGGGCGAATT
AAATCCAATG TCCGCGAAGG ACTCATGGAT ACGCTGTTTG GTATCGCTCA AAAAGATGGC
GATCGCGTGG TACAGTCTCT AATTGATTTG GGTGCGATTG CACCAGTCGA TGACATGGGA
CCTGTACGTC GGTCTGTCCA GTATATGCTG GATAACTTCA TGGATAGGCC CTTTGAAAAC
CAATCAGTGT CCGCCATCAG TGAAGACCTG TACGAAATAG CTTACAATCA ACCTTTTAGA
TTTCCAGCAA CTTTCACCTT TGTCATGCGT GCTTTTTCTA CTTTAGAAGG AGTAGGTAAA
GGTCTAGATC CAGAATTTAA TTTTATGGAA GTTGCCCAAC CTTATGCAAT GCAGCTTATG
AGTGGTAAAA ATGGTTTAGA GGGGAATAGT TTCTTGAATG AGTTAAGTCG TCAAGCAGTA
CAAGTCAGTA GCAGTGCTTT AGGATTACCA CGCAGACTAG AAGACACACT CGATAAAATC
GAACGTGGGG ATATTCGTTT CCGAGTTCGT TCCGTGGAAA CAGAACGCCT AATACGCAGA
CAGAGTAACA TTCAACTCGG AATGAGCTAT GCTCTTATAA TTAGTGGTTT CACAATTGCA
GCCACAATTC TCCTAATTGG CGAGTATTTG TGGTTAGCAG TTTTCACTGC TTTAATTGCA
GCAGCAGTAT CATTCCTGTG GATTCGTCTG CTTTTACGCC TTGACCGTTA TGATCAAAAG
TATTAA
 
Protein sequence
MEKGYSDKAY RWNREHYSSK RRFVDIWSFV LTLMFKLWRY NKAWTYPGGM TEPKQATRRK 
VQAIWIRNTF LDLGPTFIKV GQLFSTRADI FPSEYVEELS KLQDRVPAFS YEQVETIIEE
ELGKKVPQLF HSFEPIPLAA ASLGQVHKAV LHTGESVVVK VQRPGLKKLF EIDLKILKGI
ASYFQNHPKW GHGRDWMGIY EECCRILWEE IDYLNEGRNA DTFRRNFRAY NWVKVPRVYW
RYGTSRVITL EYMPGIKVSQ YEALEAAGVD RKAIARYGAQ AYLHQLLNNG FFHADPHPGN
LAVSPDGALI FYDFGMMGRI KSNVREGLMD TLFGIAQKDG DRVVQSLIDL GAIAPVDDMG
PVRRSVQYML DNFMDRPFEN QSVSAISEDL YEIAYNQPFR FPATFTFVMR AFSTLEGVGK
GLDPEFNFME VAQPYAMQLM SGKNGLEGNS FLNELSRQAV QVSSSALGLP RRLEDTLDKI
ERGDIRFRVR SVETERLIRR QSNIQLGMSY ALIISGFTIA ATILLIGEYL WLAVFTALIA
AAVSFLWIRL LLRLDRYDQK Y