Gene Aazo_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0470 
Symbol 
ID9338255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp484921 
End bp486207 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content44% 
IMG OID 
Productmajor facilitator superfamily protein 
Protein accessionYP_003720127 
Protein GI298489950 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.807635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAG TCAAATCTCT ACTGCAAGTT TTCGGTAGCC GGAAGATGGC AGCTTTGATA 
CTACTCGGTT TTTCATCTGG GTTGCCCTTG TTTTTGACTA GCAAGACCTT ACAAGCTTGG
ATGACAGTTG AAAATGTCGA TTTAACCGCC ATCGGGTTAT TTAGCCTTGT AGGTGTACCA
TACTCCTTAA AATTTCTCTG GTCGCCTCTG TTAGACTGGT TTACATTGCC ATTTTTAGGA
AGGCGACGGG GTTGGTTAAT CGCAATTCAA ATTGGGTTAC TAATAGCGCT CGCTTGCATG
GCACTGCAAC AGCCCAAACA AGCCCTACAA CTGTTAGCCA TAAACGCCGT TGCGATCGCA
TTCCTCAGCG CCACCCAAGA CATAGCTGCT GATGCTTACC GCACCGACAT TCTTGAACAA
CTAGAAATGG GCGCAGGTGC AGCAGTATTC GTCTTAGGAT ATCGTATCGC CCTACTACTC
ACAGGCTCCT TAGCCTTGAT TCTCGCCGAT ATAATTCCCT GGTCTTCCGT ATACTTATTA
ATGGCAGTCG GCATGGTAGT AGGCATAATT GCCACCGTAT TTGCACCAGA ACCCAAAGAA
ATCAGTCCAC CAGAATCCTT AAGCGCAGCC GTCATTCTCC CCTTTAGGGA ATTTATTCAA
CGTCAAGGTG TAGTTCAAGC CATACTAACT CTGTTGTTTA TAGTCCTTTA TAAACTCGGC
GATTCCTTTG TCAACAATAT GTCCACCTCA TTTTTACTAA AAACAGGCTT CAGCCAAACC
GACATTGGCG CAATTCAAGG CGGCATGGGA CTGATAGCAA CCATAGTTGG CATACTGGCA
GGTGGTGCAT TTTTGAGTAA AATTGGACTG AACCGCTCAC TTTGGCTATT TGGTGCCTTG
CAAGCAGTCA GCAACTTAGC TTACCTTTTA CTTGCACAAG TTGGTAAAAA CTATCAGGTT
CTCCTACTGA CAATTAACAT AGAAAACTTT TGTGCTGGCT TAGGAACAGC AGCCTTTGTT
GCCTTTTTAA TGAATATGTG TAATCAGCGT TATTCCGCAA CTCAATATGC TTTACTTTCT
AGTTTTATGG CCGTAAGTCG TGATATTCTA GTTGCGCCAG CAGGTTCTTT AGCAAAAAGC
ACAGGTTGGC CTTTATTTTT TGTCATTAGT ATCGTTGCTG CTATACCAGG ACTACTCCTA
TTACCATTAT TTGCTCCCTG GAACTCAAAA CCATTACCAC TCAAAAGACC AGGAATTGAA
GAAGAGGATA TATGGGGAAC CAAGTAG
 
Protein sequence
MNTVKSLLQV FGSRKMAALI LLGFSSGLPL FLTSKTLQAW MTVENVDLTA IGLFSLVGVP 
YSLKFLWSPL LDWFTLPFLG RRRGWLIAIQ IGLLIALACM ALQQPKQALQ LLAINAVAIA
FLSATQDIAA DAYRTDILEQ LEMGAGAAVF VLGYRIALLL TGSLALILAD IIPWSSVYLL
MAVGMVVGII ATVFAPEPKE ISPPESLSAA VILPFREFIQ RQGVVQAILT LLFIVLYKLG
DSFVNNMSTS FLLKTGFSQT DIGAIQGGMG LIATIVGILA GGAFLSKIGL NRSLWLFGAL
QAVSNLAYLL LAQVGKNYQV LLLTINIENF CAGLGTAAFV AFLMNMCNQR YSATQYALLS
SFMAVSRDIL VAPAGSLAKS TGWPLFFVIS IVAAIPGLLL LPLFAPWNSK PLPLKRPGIE
EEDIWGTK