Gene Aazo_4751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4751 
Symbol 
ID9342558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4855683 
End bp4856948 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content43% 
IMG OID 
Productmajor facilitator superfamily protein 
Protein accessionYP_003723061 
Protein GI298492884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.092373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCGA GCAAACAGAA ATCAACTCAC CATAACAGCA AGGCTGCACA ACACGATCCT 
TTAGCAGCCA TGAGATTTCG AGATTATCGG CTATTTACCA TTGGGCGTGT ACTCCTCTTC
ACCGGGGGAC AAATGCAGAC CGTGGCGCTG GGTTGGGAGC TTTATGAGCG GACAAATTCA
GCGATAGTAT TAGGTGGAAT AGGACTGGCG CAAGTTCTAC CAATGATTGC ACTAACTTTG
ATTACAGGAC ATATTGCTGA TAAGAGCGAT CGCAAACGCA TTACTTTATT CTCAATTTTG
CTGCTAACTC TTTGCTCAAT AGCTTTAGCA GTTATTTCCT TTAATGAAGG TGCAATTTTT
CTAGTTTATG GTTGCTTATT ATTAACAGGT GTAGCCAGAG CATTTCTCAA ACCTGCCGGT
GATGCACTGA TGTGGCAGTT AATACCCACG AGTGCTTTTA CCAATGCAGC AACTTGGAAT
AGCAGTAGCT TTCAATTAGC ATCAGTAATT GGGCCAGCTT TGGGAGGATT CAGCGTTGTT
CTTTTCGGAA ATGCGACAGG GGTATATATA TTAGCCACAT TGGCAGCACT ATCATGTTTT
TTCCTCACAG CCGCAATTAA ACCACAAAAA ACTAACTTTG CCAAAGAACC AACATCTTTA
AAAACTCTAG CTGCTGGTGC CGAATTTGTT TGGAATAATC AACTAATTCT TGCAGCCATC
ACCCTCGATT TATTTGCAGT CTTGTTAGGT GGTGCAGTTG CATTATTACC CATCTTTGCC
AAAGATATTT TGCAAGTTGG TCCGGTAGAA TTAGGCTATT TACAAGCAGC CCCATCCATA
GGTGCATTGA TTATGGCAGC ATTGTTGGTA TATTTGCCAC CTATCCACAA AGCCGGCCCT
GCCTTACTTT GGTCAGTTGT CGGGTTTGGG ATTGTGACAA TTATTTTTGG GTTATCCCGT
TGGGTCTGGC TATCGTTGTT GATGTTGGCA TTAAGTGGGG CGTTAGACAG CATTAGCGTG
GTGATTCGTC ATACTTTGGT GCAAATTCGC ACTCCTGACC ATTTGCGGGG TCGAGTTGCA
GCTATAAATA GTGTATTTAT CAGTGCTTCC AATGAATTGG GAGGTTTTGA ATCTGGTTTG
ACTGCGGCTT TATTTGGGCC TGTTTTGTCT GTCGTTGGTG GAGGTGTGGG GACGATTTTA
GTAGTGGTAG CAACAGCCAT GATTTGGCCG GAAATTCGCA AATTAGGAGC TTTGCATGAG
GATTAA
 
Protein sequence
MSSSKQKSTH HNSKAAQHDP LAAMRFRDYR LFTIGRVLLF TGGQMQTVAL GWELYERTNS 
AIVLGGIGLA QVLPMIALTL ITGHIADKSD RKRITLFSIL LLTLCSIALA VISFNEGAIF
LVYGCLLLTG VARAFLKPAG DALMWQLIPT SAFTNAATWN SSSFQLASVI GPALGGFSVV
LFGNATGVYI LATLAALSCF FLTAAIKPQK TNFAKEPTSL KTLAAGAEFV WNNQLILAAI
TLDLFAVLLG GAVALLPIFA KDILQVGPVE LGYLQAAPSI GALIMAALLV YLPPIHKAGP
ALLWSVVGFG IVTIIFGLSR WVWLSLLMLA LSGALDSISV VIRHTLVQIR TPDHLRGRVA
AINSVFISAS NELGGFESGL TAALFGPVLS VVGGGVGTIL VVVATAMIWP EIRKLGALHE
D