Gene Aazo_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1172 
Symbol 
ID9338967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1254711 
End bp1256495 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content35% 
IMG OID 
Productfamily 1 extracellular solute-binding protein 
Protein accessionYP_003720617 
Protein GI298490440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTTA CTGAAAAAAT ACTCAGTTAC ACAAAACGCT TTTTCTTGCT TATCAGCCTC 
CTTTCGTTTA CAGCCCTGAC AATTACTGCT TGTAATCCAA CTAAACTAAA AACTAGCGCC
GCACAAGTAC CACAATTAGT AACCAGTATT CTCAGCGATC CGAAAACCTT TAACTATGCT
CTTAATTCTG ATGCAAATAA TATTTTTGGA TATACTTATG AAGGATTGCT TAACCAAAAT
CCTATCACTG GTAAACTTGA ACCAAATTTA GCAGAATCAT GGGAAATTTC TGAGGATAAA
TTAAAGATTA CTTTCACTCT GCGCAATAAT TTAAAATGGT CTGATGGACA ACCACTCACA
TCTGATGATG TTGTATTTAC TTATAATGAT ATTTATCTGA ATGAAGCCAT ACCAACAGAC
GTTAGAGATA TTTTGAGAAT TGGGAAGGAT GGCAAATTTC CCAGTATTAA AAAAATTGAT
AAAAGACGGG TAGAATTTAG CATACCAGAA CCTTTTAGAC CTTTTTTACA AAACTCTGGT
GTACCTATTT TACCTGCTCA TGCCTTACGA GAATCTTTAC AAACTAAAGA CCAAGACGGA
AAACCTAAAT TTCTGACAAC ATGGGGTATT GATACTCCAC CTGAACAAAT AATTGTTAAT
GGTCCCTTCA AGCTGGAACG TTACGATACT AGTCAGCGGG TAATTCTCCG GCGTAATCCC
TATTATTGGC GTAAAGATGC TCAAGGTAAA CTCCAACCTT ATATTGAACG TATTGTTTGG
CAAATTGTCG AATCAACGGA TACTTCTTTA TTACAATTTC GTTCTAGTGG TTTAGATGCT
GTGGGGGTAG CACCAGACTA TTTTTCTTTA TTAAAGGTAC AAGAAAAACA AGGTGATTTC
CAAATTTATA ATGGAGGACC TTCTACAAGT ACAAGTTTTG TGCTTTTTAA TTTGAATCAA
GGAAAAAGAA ACGGTAAATT ACTCATAGAC CCAATTAAAT CACGTTGGTT TAATAATGTA
GATTTTCGCC AAGCTGTAGC TTATGCAATT GATAGACGAA CCATGATTAA TAATACATTT
CGTGGTTTGG GTAAACCGCA AAATTCACCC ATTTCTGTGC AGAGTCCCTA TTACCTTTCT
CCAGAAGCAG GACTAAAAGT TTATAACTAT AATCCCGAAA AATCTAAGGA ATTATTACTC
AGATCTGGGT TTAAATACAA TGCTCAAAAT CAGCTAGAAG ATGCTCAAGG AAACCATGTT
AGGTTTGCTT TACTTACCAA TGCTGGTAAC AAGATTCGTG AAGCAATGGG TTCACAAATT
AAACAGGATT TGAGTAAAGT TGGTATCCAA GTTGATTTTA CTCCTTTAGC ATGGAATACT
TTTATAGATA AGCTATCTAA TACTTTAGAT TGGGAAGCTT CTTTACTCGG TTTGACTGGT
GGATTAGAAC CAAATGATGG GGCCAATGTA TGGTCTACTG AAGGTGGATT ACATATGTTT
AATCAAAAAC CCCAACCAGG ACAAAAACCA ATAGAAGGGT GGAAAGTTTC ACCATGGGAA
GCGAAGATTC ATGAATTTTA TATTCAAGGC GCACAGGAAC TTGATGAAGC AAAGGTGACA
GAAATTTATG CAGAAGTTCA ACGATTAACA CAGGAGAATT TACCATTTAT TTACTTAGTA
AATCCCTATT CTCTTTACGC AATCCGAAAC CGTTTTCAGG GAATTAGATT TTCTGCTTTG
GGTGGTCCAT TTTGGAACAT TCATGAAATT AAAATTACAA AATAG
 
Protein sequence
MTFTEKILSY TKRFFLLISL LSFTALTITA CNPTKLKTSA AQVPQLVTSI LSDPKTFNYA 
LNSDANNIFG YTYEGLLNQN PITGKLEPNL AESWEISEDK LKITFTLRNN LKWSDGQPLT
SDDVVFTYND IYLNEAIPTD VRDILRIGKD GKFPSIKKID KRRVEFSIPE PFRPFLQNSG
VPILPAHALR ESLQTKDQDG KPKFLTTWGI DTPPEQIIVN GPFKLERYDT SQRVILRRNP
YYWRKDAQGK LQPYIERIVW QIVESTDTSL LQFRSSGLDA VGVAPDYFSL LKVQEKQGDF
QIYNGGPSTS TSFVLFNLNQ GKRNGKLLID PIKSRWFNNV DFRQAVAYAI DRRTMINNTF
RGLGKPQNSP ISVQSPYYLS PEAGLKVYNY NPEKSKELLL RSGFKYNAQN QLEDAQGNHV
RFALLTNAGN KIREAMGSQI KQDLSKVGIQ VDFTPLAWNT FIDKLSNTLD WEASLLGLTG
GLEPNDGANV WSTEGGLHMF NQKPQPGQKP IEGWKVSPWE AKIHEFYIQG AQELDEAKVT
EIYAEVQRLT QENLPFIYLV NPYSLYAIRN RFQGIRFSAL GGPFWNIHEI KITK