Gene Aazo_4790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4790 
Symbol 
ID9342597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4892988 
End bp4894841 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content41% 
IMG OID 
ProductABC transporter-like protein protein 
Protein accessionYP_003723087 
Protein GI298492910 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.918505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CTATCCTAGA GGTTCGCAAT CTACAAGTTG AGTTTTCCGG TGATGACAGC 
GCAGTTAAAG CTGTAGATGG GATTAGCTTT GAATTAAATC GAGGTGAGAC TCTAGGAATA
GTGGGAGAGT CTGGGAGTGG TAAATCAGTA ACCGCTTTAG CTGTGATGGG TTTGTTGCAA
TATCCCGGTA AAGCTAATGA AGGACAAATC TGGTTTCGTC CGCAGGAAAA TGCCGAACCG
CTGGATTTAC TGGCTTTACC TACCCAAGAA ATGCAGCTTT ACCGGGGTGG TGATATTGCT
ATGATTTTTC AAGAACCGAT GAGTTCTCTC AACCCGGTTT ATGATATTGG GTTTCAGCTG
ACAGAAGCAA TTATGCGTCA TCAAAATGTC AATGCAGTTG AAGCCAAGAG AATTGCGATC
GCAGGTCTAC AAGAAGTTAA ACTTTTACCT AGTGACCAAG AAATTCAAGA GCAATATCTT
GATAGTAGCC AGTTAACTGA CTCTAAATTA TCTCCAAGCA GTGATTATCA AATAGCTCAG
TTGGTGAAAG AACACAAAGA AGCGATGTTG AAACGCTATC CCCATCAACT TTCTGGGGGT
CAGTTACAAC GGGTGATGAT TGCAATGGCA ATTTCTTGTA ACCCATCACT GTTAATTGCT
GATGAACCGA CCACAGCTTT AGATGTGACA GTACAAGCAA CCATTATTGA GTTGATGCGG
GAATTGCAGC AAAAACGCAA CATGGGGATG ATTTTTATTA GTCATGACTT GAGTTTAATC
GCGCAAATTG CTGACCAAGT AGGGGTGATG TACAAAGGTA AAATTGTGGA ATATGGTGCA
GTATCGCAAA TTTTTAGTAA TCCCCAACAT CCCTATACTA GAGGTTTGGT AGCTTGTCGT
CCTACTCTAC ATTGTCGTCC GCACAAACTC CTCACAGTTT CTGATTACAT GAGTTTACAG
GAAGATGAAA GTGGACAGCT AGTAATTCGA GCCAAAGAAC CAGCAAAACC ACCGCAAATT
ACTCAGGAAG AACTTAACCA AAGATTGGCA AATCTGCAAG AGAAATCTCC CCTTTTACAA
ATTCATCATC TCAAAGTTGG GTTTCCTGTG CGGGGAGTGT TTGGCGGCAC AAAACGCTAC
AATATAGCAG TGAATTCTGT TTCTTTTGAT GTTTATCCAG GCGAAACTTT GGGATTGGTA
GGAGAATCTG GTTGTGGTAA AACCACTTTG GGTAGAAGTC TGCTCAGATT AATTGAACCC
ATGAGCGGTC AAATTACTTT TAAAGGACAA AATATCACTC ACCTTAAAGG AGAATCGTTG
CAAAAATTGC GGCGAGAAAT GCAAATTGTT TTTCAAAATC CTTTTAGTTC CCTTGACCCC
CGGATGAAAA TTGGTGATGC AGTTATGGAA CCATTGTTGA TTCATGGTGT GGGTAAATCA
AAACAACAGC GAAAAGAAAG AACTATACAA CTTTTAGAAC GGGTGGGATT GAGTGCGGAT
GATATGAAAC GCTATCCCCA TCAGTTTTCA GGTGGTCAAC GTCAACGGAT TTGTATTGCG
CGGTCGTTGG CTTTAAATCC TCAGTTTATT ATTTGTGATG AGTCGGTTTC GGCTTTGGAT
GTTTCGGTAC AAGCACAAGT TTTGAATTTG TTAAAAGAAT TGCAAAGGGA TTTTAATTTG
ACGTATATTT TCATTTCCCA TGATTTAAGT GTGGTCAAAT TTATGAGTGA TCGCATTTTG
GTAATGAATC AAGGGAAAAT AGTGGAAGAA GGGACATCAG AAAGCATTTA TCTTCAACCC
AAAGAAGAAT ATACGCAGAA ATTAATCGCG GCTATTCCGA CAGGGAATAA GTGA
 
Protein sequence
MKETILEVRN LQVEFSGDDS AVKAVDGISF ELNRGETLGI VGESGSGKSV TALAVMGLLQ 
YPGKANEGQI WFRPQENAEP LDLLALPTQE MQLYRGGDIA MIFQEPMSSL NPVYDIGFQL
TEAIMRHQNV NAVEAKRIAI AGLQEVKLLP SDQEIQEQYL DSSQLTDSKL SPSSDYQIAQ
LVKEHKEAML KRYPHQLSGG QLQRVMIAMA ISCNPSLLIA DEPTTALDVT VQATIIELMR
ELQQKRNMGM IFISHDLSLI AQIADQVGVM YKGKIVEYGA VSQIFSNPQH PYTRGLVACR
PTLHCRPHKL LTVSDYMSLQ EDESGQLVIR AKEPAKPPQI TQEELNQRLA NLQEKSPLLQ
IHHLKVGFPV RGVFGGTKRY NIAVNSVSFD VYPGETLGLV GESGCGKTTL GRSLLRLIEP
MSGQITFKGQ NITHLKGESL QKLRREMQIV FQNPFSSLDP RMKIGDAVME PLLIHGVGKS
KQQRKERTIQ LLERVGLSAD DMKRYPHQFS GGQRQRICIA RSLALNPQFI ICDESVSALD
VSVQAQVLNL LKELQRDFNL TYIFISHDLS VVKFMSDRIL VMNQGKIVEE GTSESIYLQP
KEEYTQKLIA AIPTGNK