Gene Aazo_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3686 
Symbol 
ID9341491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3747865 
End bp3749775 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content38% 
IMG OID 
ProductDNA primase 
Protein accessionYP_003722364 
Protein GI298492187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATCC CTCGCTTACA TCCAGATACC ATAGAAGAAG TTAAACACCG GGCTGATATT 
GTTGATCTTG TTTCAGAGTA CGTAGTTTTG CGTAAGCGTG GAAAGGTTTT TGTCGGGTTA
TGCCCCTTCC ATGATGAAAA AAGCCCTAGT TTCACGGTTA GTCCCAGCAA GCAGATGTAT
TATTGCTTCG GTTGTCAAGC TGCCGGTAAC GCCATTAAAT TTCTCATGGA TTTGGGTAAA
TACCAGTTTA CAGAAGTGGT GTTAGATTTA GCACGGCGTT ATCAAGTACC TGTCAAAACC
TTAGAACCGG AACAAAGACA AGAATTACAA CATCAGTTGT CTTTGCGTGA GCAGTTATAT
GAGGTTTTAG CGTCCACAGC CCAATTTTAT CAACACGCCC TCAGACAAAG TTTAGGACAA
AAAGGGATGC AGTATTTACA AGAACATCGT CAATTCAAAA CCGAAACAAT TCAACAATTT
GGTTTGGGTT ATGCACCCGC AGGTTGGGAA ACCTTACATC GGTATTTGGT GGAAGATAAA
CATTACCCAG TCCAGTTGGT GGAAAAAGCG GGTTTGATTA AACCCAGGAA AGATGGAGGC
GGTTATTATG ATGTATTTCG TGATCGCTTA ATGATTCCCA TCCGTGATAT ACAAGGACGA
GTTATTGCTT TTGGTGGGAG AACGCTGACA GAAGAACAAC CAAAATATTT AAACTCCCCA
GAAACAGAAC TTTTTAGTAA AGGTAAAACC TTATTTGCAT TAGATCACGC CAAAGATGGT
ATTTCTAAAT TAGATCAAGG AGTAGTTGTA GAGGGATATT TTGATGCGAT CGCTCTCCAT
GCAGCAGGGA TTAATAACGC CGTCGCTTCC CTGGGTACAG CTTTAAGCAT GGAACAAGTC
CGGTTGCTAT TACGCTACAC CGATTCAAAA CAATTAGTTC TAAACTTTGA TGCAGATAAA
GCCGGAATCA ACGCCGCAGA AAGGGCTATT GGTGAAATCG CGACATTAGC CTATAAAGGC
GAAGTGCAGT TAAAAATCCT TAATATACCC CATGCAAAAG ATGCCGATGA ATACTTGCAC
AGTCATACAG CAGAGGAATA TCAACAACTA TTAGCAAACG CCCCACTTTG GTTAAATTGG
CAAATTGCGG AAATCATCAA ACACAAAGAT TTAAGACAAG CTACTGCTTT TCAAAAAGTT
ACAAAAGAAA TCGTAAAACT CTTGCAAAAT ATAGTTAATA GTGATACACT AAATTATTAC
ATTTCCTACT GTGCAGAAAT ACTCAGCTTA GGGGACGCTA GATTAATACC CCTCAGAGTT
GAGAATTTAC TAACTCAAAT TGCACCCACT AGTGTGCAAA GTCTACCACT GCGCTCGCGC
AAACAGAAGT TGAAAACGCC GAAGCTATCT CTAGTAACCA CTGAACGGAG TTTATTAGAA
CAAGCAGAGG CGTTATTATT ACAAATATAT TTACATTGTC CAGAACAGCG CCAAGTAATA
ATTGATGAAC TAGAAGAGCG AAATTTAGAA TTTAGCCTTT CTCATCATCG CTTTTTTTGG
CAACAGAGTT TAGAGTTTCC AGTAGAAGAA GTAGATTTAA TTTTTAGTTT GGAAAACAAG
TATTTAGAAT TATCAGAAGA CTTAATATTA ATTTCTCATT TATTTCATTT GAATGAAAAA
ACCAACAAAG AAATACTGCG CACTCCCCAA GTACTTCAAG CCACATTTGC TTGTATGGAA
ATAGTCTTGA GAGAAAAACG CTATCGTTAT TTTATGGAAC TGTGGGAAAA AACTGATCCA
CTAACAGAAC CAGAGAAAGA TAAGTTTTAT GCTGATGCTA TGTATGCTGA AAAAATGCGT
TTACAAGAAT TAGATAAACA ACGGTATTTT TCAATTAGAG AATTACTTTA G
 
Protein sequence
MQIPRLHPDT IEEVKHRADI VDLVSEYVVL RKRGKVFVGL CPFHDEKSPS FTVSPSKQMY 
YCFGCQAAGN AIKFLMDLGK YQFTEVVLDL ARRYQVPVKT LEPEQRQELQ HQLSLREQLY
EVLASTAQFY QHALRQSLGQ KGMQYLQEHR QFKTETIQQF GLGYAPAGWE TLHRYLVEDK
HYPVQLVEKA GLIKPRKDGG GYYDVFRDRL MIPIRDIQGR VIAFGGRTLT EEQPKYLNSP
ETELFSKGKT LFALDHAKDG ISKLDQGVVV EGYFDAIALH AAGINNAVAS LGTALSMEQV
RLLLRYTDSK QLVLNFDADK AGINAAERAI GEIATLAYKG EVQLKILNIP HAKDADEYLH
SHTAEEYQQL LANAPLWLNW QIAEIIKHKD LRQATAFQKV TKEIVKLLQN IVNSDTLNYY
ISYCAEILSL GDARLIPLRV ENLLTQIAPT SVQSLPLRSR KQKLKTPKLS LVTTERSLLE
QAEALLLQIY LHCPEQRQVI IDELEERNLE FSLSHHRFFW QQSLEFPVEE VDLIFSLENK
YLELSEDLIL ISHLFHLNEK TNKEILRTPQ VLQATFACME IVLREKRYRY FMELWEKTDP
LTEPEKDKFY ADAMYAEKMR LQELDKQRYF SIRELL