Gene Aazo_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4040 
Symbol 
ID9341845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4099059 
End bp4100714 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content40% 
IMG OID 
Productputative sodium symporter protein 
Protein accessionYP_003722628 
Protein GI298492451 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.824873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCAGTTG AACTTTGGAC AATTATCTTA GTTGGACTTT CTTTTGCACT CTATATTTAC 
ATTGGTTGGC AATCACGGGT AAAGGATACA AAAGATTTCT TTATTGCTGG TCAGGGTATT
CCTTCCATTG CTAACGGTGC AGCAACAGCA GCTGATTGGA TGTCTGCGGC TTCGTTTATT
TCCATGGCTG GGCTAATTTC TACTTTGGGT TATGATGGTT CAATTTATTT GATGGGTTGG
ACTGGTGGCT ATGTTTTGTT AGCTTTATTA TTAGCCCCAT ATTTGCGGAA ATTTGGTAAA
TATACAGTTC CCGATTTTGT AGGCGAGCGC TATAAATCTA ATCTAGCTCG TTTAGTAGCA
GTAGTTGCAG CTATTTTCGT TTCTCTTACC TACGTTGCTG GCCAGATGCG CGGTGTAGGT
ATTGTCTTTA GCCGCTTTTT AGAAGTTGAT ATCAATACTG GTGTCATTAT CGGTATAGTA
ATAGTCGGCT TTCTTGCAGT GTTGGGAGGA ATGAAGGGTA TTACTTGGAC ACAAGTTGCC
CAATATTGCA TCTTAATTTT CGCTTATTTG ATTCCTGCCA TTGCGCTCGC CTTTATTCTC
ACGGGCAATC CTGTTCCTCA GTTAGCATTT ACCTTTAGTG ATGTTGCTGA TAACTTAAAT
AAAATTCAGA CCGATTTAGG GTTTGCACAA TATACCCAAC CTTTTGCTAA CAAAACCATG
ATGGATGTCC TATTCATCAC TATTGCTTTG ATGGTAGGAA CTGCGGGTTT ACCCCACATT
ATCGTCCGGT TTTACACAGT ACCTAATGTG CGTGCAGCGA GATTTTCCGC AGGTTGGGCA
TTATTATTTA TCGCCATTCT TTACACAACT GCCCCGGCTT TATCCATGTT TGCCCGCTAC
AATTTAATTC AATCTCTCCA CAACCATACA GTTGAAGAGG TTAGACAATT AGACTGGGCA
AATAAATGGG AAAAAACCAA ACTCCTCACT TTTGAAGATA AAAACAAAGA TGGTAAATTA
CAGTTAACTA GCAAAAAAGA AACTAACGAA ATTACCATCG ACAATGATAT TATTGTTCTC
TCTACCCCAG AAGTTGCTAA ACTCGCACCT TGGGTCATAG CTTTAGTGGC AGCGGGAGGT
TTAGCAGCAG CATTGTCAAC AGCTTCTGGT TTATTACTAG TAATTTCTAG TTCTATTGCC
CATGACGTTT ATTATCGCAT CTTCGATTCT ACAGCTTCCG AAGAAAAACG AGTATTTGTA
GGCCGAACAG TTGTCGGTTT TGCCTTAGTT CTTGCAGGTT ATTTTGGCGT AAACCCCCCT
GGTTTTGTGT CTCAGGTCGT AGCTTTTGCC TTTGGTTTAG CTGCTGCTAG TTTCTTCCCA
GTGATAGTTT TAGGAATTTT TGATAAACGC ACAAATGCCG AAGGTGCTAT TGCGGGAATG
TTAACCGGTT TCATTTTCAC TATCATCTAT ATTATCGGTG TGAAATTTAC GGGAATGACA
CCTTGGTTTT TTGGAGTTTC TGCTGAAGGT ATCGGCACCT TAGGGATGAT CATTAATTTT
ATTGTCACCA TTACAGTTTC CCGTTGTACT CCACCACCAG GAGCAGATAT TCAAGCTTTA
GTTGAAGATT TACGTACTCC TAGTTTTGAA GAATAG
 
Protein sequence
MSVELWTIIL VGLSFALYIY IGWQSRVKDT KDFFIAGQGI PSIANGAATA ADWMSAASFI 
SMAGLISTLG YDGSIYLMGW TGGYVLLALL LAPYLRKFGK YTVPDFVGER YKSNLARLVA
VVAAIFVSLT YVAGQMRGVG IVFSRFLEVD INTGVIIGIV IVGFLAVLGG MKGITWTQVA
QYCILIFAYL IPAIALAFIL TGNPVPQLAF TFSDVADNLN KIQTDLGFAQ YTQPFANKTM
MDVLFITIAL MVGTAGLPHI IVRFYTVPNV RAARFSAGWA LLFIAILYTT APALSMFARY
NLIQSLHNHT VEEVRQLDWA NKWEKTKLLT FEDKNKDGKL QLTSKKETNE ITIDNDIIVL
STPEVAKLAP WVIALVAAGG LAAALSTASG LLLVISSSIA HDVYYRIFDS TASEEKRVFV
GRTVVGFALV LAGYFGVNPP GFVSQVVAFA FGLAAASFFP VIVLGIFDKR TNAEGAIAGM
LTGFIFTIIY IIGVKFTGMT PWFFGVSAEG IGTLGMIINF IVTITVSRCT PPPGADIQAL
VEDLRTPSFE E