Gene Aazo_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1767 
Symbol 
ID9339560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1829343 
End bp1830770 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content43% 
IMG OID 
Productphosphoglucosamine mutase 
Protein accessionYP_003721016 
Protein GI298490839 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACC CTAGCAATTC AGGTAAGATA AAATTTGGTA CTGATGGATG GCGAGGTATT 
ATTGCCGATG ATTTTACTTT TCCCAATGTG CGGAAAGTAA CCAGGGCAAT AGCTAGCTAT
CTGGAAACTG CCTATAGCAA GGACAGACCA GTTTTAATAG CCTACGATAC ACGGTTTTTA
GCTGACGAGT TTGCCCGTAC ATCTGCCGCA GTGCTGGCAG ATTTGGGTTG GAATGTGAAA
ATTACTGATC GAGATTGTCC CACACCAGTA ATTGCCTACA ATGCGCGTCA CCTCAATTCC
GCAGGGGCGT TGATGTTTAC TGCTAGTCAT AATCCTGCAC CCTATTGTGG GATTAAATAT
ATTCCAGACT ATGCTGGGCC TGCTACTCCA GAAATTACTG ATACTATTGT GGCAAATATA
GAAACTGCAT CCGATGAGTT GCCCGGAAGT AACCCATCAG GTACTATTTC TATTTTTGAT
CCTAAGCCCG ATTATTTGCA TTTCATCTAC ACCTTGCTAG ATGTGGAAAA AATTAAAGGT
GCGAATTTAA AGGTTAAGTA CGATGCTCTC TATTCTACCT CTCGTGGCTA TTTAGATGAG
GTGTTGGAAC ATTGCGGTAC TCAGTTAGAA AGTTTCCATA TTTGGAGGGA TGTACTTTTT
GGCGGTGGTA TGCCTGAACC CAAGGGGGAA CAGTTAGTTG AGTTGGTGGA AGCTGTTAAG
ACTGATAAGG CTGATTTGGG TTTGGCAACT GATGGGGATA GTGATCGCTT TGGTATTGTT
GATGAACAAG GTAATGTTCT TACTCCTAAT ACAGTGCTGT TAGTTTTAGC ACGGCATTTA
ATAAAAAACA AGGGTAAAAC TGGCGCAATT GTTCGCACTG TGGCGACTAC TCATTTGTTA
GATAATTTTG CGGCTAAATA TGGTTTGCAA ATTTATGAAA CTGCTGTTGG TTTTAAATAC
ATTGGTGAAA AAATGCGGGA AACCGCTGTT TTGATTGGTG GTGAGGAATC TGGTGGTTTA
AGTATTATTG GCCATATTCC TGAAAAGGAT GGTGTTCTAG CCGATATGCT GGTGGCTGAG
GCGATCGCTT ATGAAGGTAA GCCTTTGAGT CAGTTGGTGC AGGAAGCGAT CACAGAAGCT
GATGGTCCTT TGTATAACAA CCGCTTGGAT TTACACCTTA CTGAGTCTCA CAAAAATGCT
GTGATTGATT CCTTCACCTT AATGCCTCCT TCCGAAGTAG CAGGAATTAA GGTGAAGGAG
GTGGGACGTA AGGATGGCAT TAAGCTGTAT TTGGAAGAGG GTAGTTGGGT ATTACTGCGT
CCTTCTGGTA CAGAACCTTT GGTGAGGGTG TACATGGAAA CTAATTCACC GGAAAAACTT
AGCCAGATTG CAGCGACTAT GGAAGCTGAA ATTGCTAAGT TAGCATAA
 
Protein sequence
MSNPSNSGKI KFGTDGWRGI IADDFTFPNV RKVTRAIASY LETAYSKDRP VLIAYDTRFL 
ADEFARTSAA VLADLGWNVK ITDRDCPTPV IAYNARHLNS AGALMFTASH NPAPYCGIKY
IPDYAGPATP EITDTIVANI ETASDELPGS NPSGTISIFD PKPDYLHFIY TLLDVEKIKG
ANLKVKYDAL YSTSRGYLDE VLEHCGTQLE SFHIWRDVLF GGGMPEPKGE QLVELVEAVK
TDKADLGLAT DGDSDRFGIV DEQGNVLTPN TVLLVLARHL IKNKGKTGAI VRTVATTHLL
DNFAAKYGLQ IYETAVGFKY IGEKMRETAV LIGGEESGGL SIIGHIPEKD GVLADMLVAE
AIAYEGKPLS QLVQEAITEA DGPLYNNRLD LHLTESHKNA VIDSFTLMPP SEVAGIKVKE
VGRKDGIKLY LEEGSWVLLR PSGTEPLVRV YMETNSPEKL SQIAATMEAE IAKLA