Gene Aazo_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2301 
Symbol 
ID9340101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2387998 
End bp2389749 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content41% 
IMG OID 
ProductABC transporter-like protein protein 
Protein accessionYP_003721391 
Protein GI298491214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACAAT CTCGACGGCT TGCTAAACTC GGTACTTACC TACGTCCCCA TTGGCAGGAC 
ACTGCTTTGG GAATTATTGC TTTATTCTCT GTCAATGGGC TGGGCGTTTA TATCCCTTTG
CTGATTCGCT CTGGGGTGGA TACACTTTCA ACAACCTTTA GTTTGAATCA AGTAATACGT
TATGCGGTCA TAATTATCTC GCTTAGTTCC GCAATGTGGA TGATGCGTAT GGCTTCCCGC
ATCTGGATTT TTGGTGTGGG TCGTCAGGTA GAATTTGAAC TCAAGCAGCG AATTTTTCAG
CATTTGCTAA AACTTGAACC GGCTTATTTT ACGACCAATA CTCCTGGTGA TTTAATTAGT
CGGGCTACTA GCGATGTGGA AAATATCAGG CGGTTGCTAG GTTTTGCGGT ATTGAGTTTA
GCAAATACTT TGTTTGCTTA TGCTCTGACG GTGCCAGTGA TGCTATCAAT TAGTGTGGAT
CTGACATTAG CTTCTCTGGC AGTCTATCCA TTTATGTTCT TGTTGGTGTA TTTGTTTAGC
GATCGCTTAC GTCAACAACA AGCAGCAGTG CAAGAACAAC TTTCGGATAT GAGTGAACTG
ATTCAAGAAG ATATCAGTGG CATTTCTTTA ATTAAGATTT ATGCTCAAGA AGAGAATGAG
CGTCAAGCCT TCCAAAAAAA GAATCAGGAT CTATTGACAG CTAACCTAAT ATTGGCACGA
ACCAGAAATA CGCTTTTTCC CCTAATTGGT GGTTTGGCTA ATATCAGTTC TCTGATCATC
ATTTGGTTGG GAACGATGCG GATATCTTCG GATTCTTTAC CTGTGGGTGA TTTTTTGGCT
TTATTAATTT ATGTAGAGCG TCTAGTTTTT CCCACGGCCT TGCTAGGATT TACAATTACT
GCGTATCAAC GGGGTGAAGT AAGTATTGAC AGACTAGAGT CAATTTTGAG TGTAACTCCA
AAAATTCAAG ATCCACCGGA TGCTATCCAT TTGGCAGCCA ATACAATCAA AGGGGAAGTA
ACAGCTAAAA ATCTTAGTTA CACCTACCCT GGTGCCAATA TCCCCGCTTT AAATGATATT
AACTTGACTA TTGCTCCTGG GGAACTAGTG GCAATTGTTG GGGCTATTGG TGCTGGAAAA
TCTACTTTGG CTAATACTTT GCCTCGATTG TTAGATATTA AAGCGGGGCA GTTGTTTTTG
GATGGTTTGG ATATTACAAA AATAGCTTTA AATGATTTGC GGCGTGCGAT CGCTTACGTA
CCTCAAGACA GCTTTTTATT CAGCACTACC ATTCAAAATA ATATCCGCTA CGGGGACCCA
GTCCGTAAAC AAGAAAACGT GGTAACAGTC GCCCGTCTGG CTCAAATGGA AGCAGAAATT
AACAATTTTC CCCAGCAATA TGAAACAATT GTCGGTGAAA GAGGAATAAC TCTTTCAGGT
GGACAGCGAC AACGTACAGC TTTAGCTAGA GCAATGTTAA TTGATGCTCC AGTTTTAATT
TTAGATGATG CCCTTTCCAG CGTTGATAAT CAAACAGCTA CCCAAATTCT CAGAAATCTC
TCTAGTGGTA CACAACAGAA AACCGTGATT TTCATCACCC ATCAACTTTC TGCTGCTGCT
ACTGCGGATA GAATTATGGT CATGGACAAG GGTGAAATTG TCCAAATTGG TAAACATACA
GAACTTGTTG AACAGCCAGG ATTATATAAA AAGTTGTGGA GTCAGCATCA AGTACAAGAA
TTGCTTAAAT AG
 
Protein sequence
MAQSRRLAKL GTYLRPHWQD TALGIIALFS VNGLGVYIPL LIRSGVDTLS TTFSLNQVIR 
YAVIIISLSS AMWMMRMASR IWIFGVGRQV EFELKQRIFQ HLLKLEPAYF TTNTPGDLIS
RATSDVENIR RLLGFAVLSL ANTLFAYALT VPVMLSISVD LTLASLAVYP FMFLLVYLFS
DRLRQQQAAV QEQLSDMSEL IQEDISGISL IKIYAQEENE RQAFQKKNQD LLTANLILAR
TRNTLFPLIG GLANISSLII IWLGTMRISS DSLPVGDFLA LLIYVERLVF PTALLGFTIT
AYQRGEVSID RLESILSVTP KIQDPPDAIH LAANTIKGEV TAKNLSYTYP GANIPALNDI
NLTIAPGELV AIVGAIGAGK STLANTLPRL LDIKAGQLFL DGLDITKIAL NDLRRAIAYV
PQDSFLFSTT IQNNIRYGDP VRKQENVVTV ARLAQMEAEI NNFPQQYETI VGERGITLSG
GQRQRTALAR AMLIDAPVLI LDDALSSVDN QTATQILRNL SSGTQQKTVI FITHQLSAAA
TADRIMVMDK GEIVQIGKHT ELVEQPGLYK KLWSQHQVQE LLK