Gene Aazo_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2540 
Symbol 
ID9340339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2643450 
End bp2645093 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content43% 
IMG OID 
Productmajor facilitator superfamily protein 
Protein accessionYP_003721559 
Protein GI298491382 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.824873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATCGT CTGATTTGGA TAGAAAAGTC CCGCCTCTAT CACCTAGCCA AGCTCAAAAC 
AAGATTAGGG TATCAGAATC TACAAGACTT AATCATTTAA ATTCTGTTCC CCAGTCTCAG
CCTGGCCAAA TTAATAAGCA AGAAATGTCT GTCAAAGAAG TTGCTAATGA TCAAAAAGGG
ACAACAGAAG TTAATTTAAC TGATATCCCT AAACCATCGG CACTGCATTC TGACATCGCC
CTGACAATTA CTGAACAAAA TGGCAAAGTT AATAGAACAC AACAGACTTT GCCAGAAGCG
ATTTCAACAG AAACATCACA ATTAAATGGC TCTGGTTCGG GTGGAGAAAC GGCATCACGA
GATGTAGAAA AACAAGGATT TTTTCCTGTC CTGAAAAACC CGAATTTTCT AGCTCTTTGG
GGTGGTCAAA TTTTCTGTCA ACTGGCGGAT AAAGTATATC TGGTGCTGAT GATTGCTCTG
ATTAATAGCG AGTTTCAACA GGGTGGTCAA AGCATTAGTG GTTGGGTATC GGCGTTAATG
ATGATTTTTA CCATTCCCGC AGTGCTGTTT GGTTCAGTCG CTGGTGTGTT TGTAGATCGC
TGGTCAAAAA AAGTTGTGCT GGTGGCATCG AATATTTGGC GCGGTATTCT AGTTTTAGCC
ATTCCTTTTT TACTTTGGTT AACCTATGAT TGGCAACCTG TAGGAGTTTT GCCGGTGGGT
TTTCTGATGA TTTTGGCAGT AACTTTTTTG GTTTCTACGT TGACACAGTT TTTTGCACCA
GCAGAACAGG CTGCTATTCC TTTGGTGGTG GAAGAACAGC ATTTACTTTC TGCTAATTCC
CTGTACACAA CTACGATGAT GGCATCGGTA ATTGTCGGTT TTGCTCTGGG GGAACCAGTT
TTGGTATTAG CAGATGGAAT TTGGTCACAA TTCGGTGGTA GTGGAGGACT GGGTAAAGAA
ATTTTGGTTG GTGGTAGTTA TGCGATCGCC GGAATTATTT TATTACTGCT CAGAACTAAC
GAAAAACCCA ACCCCCCAGA AACAGAATTC CCTCATGTTT TCTCTGATCT GCGCGATGGT
TTGCGTTATC TCCAAGAAAA TCAGCGTGTC CGCAATGCTT TATTACAACT AATTATTTTA
TTTTCTGTCT TTGCAGCCTT AACCGTACTC GCGGTTCGCA TGGCAGAAAT TATCCCCAAT
CTCAAAGCTT CCCAATTCGG CTTTTTACTC GCATCTGGTG GTGTTGGTAT CGCCGGTGGT
GCAACCATTC TCGGTCAATT TGGACAACGC TTTTCCTATA GGCAACTTAG TCTGTGGGGT
TGTCTCGGCA TGGCAGCATC TTTATTCGGT CTTTCCATCT TCACAACCCA GCTAGGTGCA
GTGCTGCTAT TACTAGCTTT AGTTGGTGTA TTTGGTGCTT TGGTGGGTAT CCCAATGCAA
ACGGCTATTC AAACCGAAAC ACCCCCAGAA ATGCGCGGCA AAGTGTTTGG CCTGCAAAAT
AATGTGATTA ATATTGCCCT CACCCTACCC CTAGCATTAG CAGGTGTAGC CGAAACCTTT
CTTGGCTTAC AGTCAGTCTT TTTGGCATTA GCTATCATCG TTTTCTTGGG AGGTATCTTA
ACCTGGTACA ATTCCCGTGG TTAG
 
Protein sequence
MQSSDLDRKV PPLSPSQAQN KIRVSESTRL NHLNSVPQSQ PGQINKQEMS VKEVANDQKG 
TTEVNLTDIP KPSALHSDIA LTITEQNGKV NRTQQTLPEA ISTETSQLNG SGSGGETASR
DVEKQGFFPV LKNPNFLALW GGQIFCQLAD KVYLVLMIAL INSEFQQGGQ SISGWVSALM
MIFTIPAVLF GSVAGVFVDR WSKKVVLVAS NIWRGILVLA IPFLLWLTYD WQPVGVLPVG
FLMILAVTFL VSTLTQFFAP AEQAAIPLVV EEQHLLSANS LYTTTMMASV IVGFALGEPV
LVLADGIWSQ FGGSGGLGKE ILVGGSYAIA GIILLLLRTN EKPNPPETEF PHVFSDLRDG
LRYLQENQRV RNALLQLIIL FSVFAALTVL AVRMAEIIPN LKASQFGFLL ASGGVGIAGG
ATILGQFGQR FSYRQLSLWG CLGMAASLFG LSIFTTQLGA VLLLLALVGV FGALVGIPMQ
TAIQTETPPE MRGKVFGLQN NVINIALTLP LALAGVAETF LGLQSVFLAL AIIVFLGGIL
TWYNSRG