Gene Aazo_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3035 
Symbol 
ID9340838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3120924 
End bp3122804 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content38% 
IMG OID 
ProductABC transporter-like protein protein 
Protein accessionYP_003721935 
Protein GI298491758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGTCT ATCCAGACCA GGAAAAATCC CACAACAAAT ACTCTCGCAA TGATAATGAT 
TGGCGGTTAT TTTTACGTCT AGTACCCTAT GCCCGTCGCA ATAGGGAGTT ACTGGCGCTA
TCAATGTGTT TACTAGTACC CATTGCTGTT GCTAACTCCG TCCAACCGTT GTTAATTGGA
CAGGTTATAT CTCTGATTCG TGAGGAACCG AGTACCTATG AATTTCTTAA AAACCGCTCC
CTATGGGAAG GGTTAAACAT CCTGCAAGGA TTATTTCTGG CTACAATTAT CGTCAGACTA
TCACTAACAG GGCTTCAGGG TTATCTAGTA CAAAAGCTAG GACAAAAAAT CACTGCTGCA
ATTCGTGAAG ATTTATTCCA TCATGTCACC TCTTTAGCAG TGCGTTTCTT TGAACGTACA
CCTGTAGGTA AACTTATTAC CAGACTCATC AATGATGTTG AAAGTTTAGG AGATGTCTTT
GCTACTGGTG CTATTGGCAT TGTCTCGGAT TTATTTTCCA TGTTGATGAT TATTGGTTTA
ATGTTTTCTA TTCAATGGCA ATTGGCTTGT TTACTACTAT TAATTCTGTT ACCAATCACT
TCGGTAATTA TTTACTTTCA GCAGCAGTAT CGCCGTGCTA ATTACAAAGC AAGAGAAGAA
CTATCAAAAC TAAATTCCCA ATTGCAAGAA AATATCGTTG GCATTAATGT AGTGCAGTTG
TTCCGCAGAG AAAAATTCAA TGCAGAATTA TTTCGCGCTA CGAATAACAT TTATGTTAAA
GAAGTTGATG ATACCATTTG GTATGATTCC GCTGTTTCTG CGACCTTAGA ATGGATTAGT
CTGATTGCAA TCGCAGGGGT GTTATGGGTA GGTGGTTTGT TGCTATTAGG ACAAAATATC
ACCTTTGGAA TTTTATCAGC ATTTATTTTG TATGCCCAAC AATTATTTGA CCCTCTGCGG
AACTTTGCTG AAAAATTCAC TGTTATTCAA GCTGGTTTCA CTGCCATTGA AAGGGTCAAC
GATATTTTAG ATGAACCGAT AGAGATCAGA GATCAAACTA ACCCTCATTT TTCAATTTTA
GATACTCAAT TTGGTTATAT AGATGAGATA ATTAGGGATT TAGAAAACCA GGATTTTACT
ACTCCACCTG ATTTGGGAGA AATCCGGTTT GAGAATGTTT GGTTTGCTTA CAAAGATGAT
GATTATGTAA TTAAGAATTT ACACTTTACT ATTCATCCAG GTGAAAAAAT AGCCTTAGTC
GGTCCTACAG GTGCGGGCAA AAGTTCTATC ATCAGGCTTT TATGTCGTCT TTATGAACCG
ACTCAAGGAC GCATTCTGAT TGATGGTATA GATCTCCGTG AGATACCACA GTCAGAATTG
CGGCGTTATA TGGCAGTAAT TTTGCAAGAA GCTTTTTTGT TTGCTGGTGA TGTCAAAAGT
AATATTACTT TAGGAGATAG CTACACTTTT GAAGAAATTG AGAAAGCAGC AGCAAAAACT
AATGTAGCTG ATTTTATCCG TCAATTACCT CAAGGCTATG AGACCCAATT ACGAGAGCGG
GGAAAAAATA TTTCTAGTGG ACAAAAGCAA CTGTTAGCAT TTGCTCGTGC TGCTATTCGT
AATCCGCAAA TTTTAGTATT GGATGAAGCT ACTGCTAGTT TAGATGTAGG CACAGAAGCT
TTAATTCAAC AGGCATTAAA TCAGCTTTTA ATCAAACGAA CTGCAATAAT AATTGCTCAT
CGCTTGTCAA CAATCCGCAA TGTAGACCGG ATTTTTGTCT TGAATCGGGG GGAAGTAATT
GAACACGGAA CTCATGAACA ATTACTAGGA CAAGGAGGAC TTTACGCAAC GTTGCATAAT
TTACAAATGT TGGGGATTTG A
 
Protein sequence
MVVYPDQEKS HNKYSRNDND WRLFLRLVPY ARRNRELLAL SMCLLVPIAV ANSVQPLLIG 
QVISLIREEP STYEFLKNRS LWEGLNILQG LFLATIIVRL SLTGLQGYLV QKLGQKITAA
IREDLFHHVT SLAVRFFERT PVGKLITRLI NDVESLGDVF ATGAIGIVSD LFSMLMIIGL
MFSIQWQLAC LLLLILLPIT SVIIYFQQQY RRANYKAREE LSKLNSQLQE NIVGINVVQL
FRREKFNAEL FRATNNIYVK EVDDTIWYDS AVSATLEWIS LIAIAGVLWV GGLLLLGQNI
TFGILSAFIL YAQQLFDPLR NFAEKFTVIQ AGFTAIERVN DILDEPIEIR DQTNPHFSIL
DTQFGYIDEI IRDLENQDFT TPPDLGEIRF ENVWFAYKDD DYVIKNLHFT IHPGEKIALV
GPTGAGKSSI IRLLCRLYEP TQGRILIDGI DLREIPQSEL RRYMAVILQE AFLFAGDVKS
NITLGDSYTF EEIEKAAAKT NVADFIRQLP QGYETQLRER GKNISSGQKQ LLAFARAAIR
NPQILVLDEA TASLDVGTEA LIQQALNQLL IKRTAIIIAH RLSTIRNVDR IFVLNRGEVI
EHGTHEQLLG QGGLYATLHN LQMLGI