Gene Aazo_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3972 
Symbol 
ID9341776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4035371 
End bp4037500 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content43% 
IMG OID 
ProductK+-transporting ATPase subunit B 
Protein accessionYP_003722586 
Protein GI298492409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.315323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCCG CTACAACTAC TCCTAGACCA AAGTCACCCC GTTCTCGTCC TAGTGATCGC 
CGTCAAAAAC GCAAAAAAGT CCGAGTCAGT ACCAAACTAC TCTATACCAG AGTAATTAGA
GATACCTTTA TCAAGCTATA CCCCAAATAT GCTATTCAAA ATCCCGTCAT GTTTTTGGTT
TGGTTAGCAA CAACTACCAC CCTAGCCGCT ACAATTTACC CACCATTATT TGGGCCAGTC
ACCCAGAAAA ACCCCCAACT ATTCAATGGT TTATTAACAC TAATTTTGTT TTCGACAGTT
ATTTTTGCCA ATTTTGCCGA AGCCTTAGCA GAAGGAAGAG GTAAAGCCCA AGCAGATGCA
TTACGATCTA CCAGATCAGA AACCATCGCC AAAAAAATTG TTGTTGATGG CACAATCACA
GAAATTTCTT CCACTACCCT CAAACAAGGT GATACTGTTT ACGTTGTTGC TGGTGATATG
ATCCCCGCCG ATGGCGAAGT AATTCTGGGT ATCGCCTCTG TAGATGAATC AGCAATTACC
GGCGAATCAG CCCCAGTCCT CAAAGAATCA GGTTCAGACG TTTCCAGTTC CGTTACAGGT
GGTACACGTA TCATCTCCGA TGAATTAATT ATTCGTATTA CCGCTGACCC TGGTAAAGGA
TTCATTGACC GGATGATTGA TTTGGTCGAA GGGGCAGAAT GTACTAAAAC ACCTAATGAA
ATTGCTTTAA CAGTATTACT AGCAGTTCTT ACCACCGCCT TTTTCATTGT CGTCGCCACC
TTGCCAGTCT TTGCCTACTA TGTTAAAAGT CCTGTCAGCA TCCCTGTACT AATTGCCTTG
TTAGTTGCCC TCATCCCCAC CACCATCGGC GGCTTACTCA GTGCCATTGG TATCGCAGGT
ATGGATAGAG TGGCCCAATT TAACCTGATT GCCACCTCTG GCAGGGTGGT GGAAGCCTGT
GGGGATATCA ATACTTTGGT TTTAGATAAA ACAGGTACAA TTACCCTGGG CAACCGTTTA
GCAGAAGCCT TTCTTCCCAT CAATGCTCAC TCAATGGCAG AAATTGCTAA CGTTGCTTTA
GGCGCTAGTA TATTTGACGA TACTCCAGAG GGTAAATCTA TTGTCCGACT AGCAGAAAAA
TGTGGCGCAA AATTTGATAT CGATCGCAAA AAATGCCAAG GTGTGGAATT TTCAGCGACA
ACTCGCATGA GTGGCACTAA TTTGCCAGGT GGACACGAAG TACGGAAAGG CGCAGTCGGA
GCAATTAAAG GTTTTGTGCG ATCGCGTAGG GGCTCTTCTG GAGCATCGCA CAATGGAAAT
GAAACCCCAG AACTAGTCAC TGCTTATGAA GCAGTTTCCC ACCAAGGCGG TACACCCCTA
TTGGTTTGCC TAGATAATGA AATCTATGGT GTCATTTATC TAAAAGATAT TGTTAAACCT
GGAATCCATG AACGTTTTGA TCAACTACGA CGCATGGGAG TCCGAACAGT GATGCTAACT
GGAGACAATC GCATCACGGC TTCTGTCATT GCTAAAGAAG CTGGAGTAGA TGAATTCATA
GCCAAAGCCA CACCAGAAGA TAAAGTCAGC ATTATTCAAC GGGAACAAGC AGAAGGCAAA
GTGGTGGCGA TGACCGGAGA CGGCACTAAT GATGCACTCG CTTTAGCGCA GGCTGATGTA
GGAATAGCCA TGAATACAGG CACTAAATCC GCTAAAGAAG CTGCTAATAT GGTGGATTTG
GATTCTGATC CCACAAAACT AGTTGATATT GTCAGCATTG GTAAACAATT GCTCATTACT
CGTGGAGCAT TGACCATATT CTCCATCGCC AATGATATAG CTAAGTATTT TGCCATTATC
CCGGTGATTT TTGCTGCTGC TAATCTCAAC AGCTTAAATA TTATGAAATT AACCAGTGTT
AACTCTGCGA TACTGTCAGC ACTGATTTAT AATGCTTTAA TTATTCCAGC TTTAATCCCT
TTAGCTTTGA AGGGTGTAAA ATTTCGACCC TGGACTGCTA ATCAACTTCT GCAACGCAAT
ATTCTTATTT ATGGCTTGGG TGGGGTGATT GTTCCATTTA TTGCCATTAA ATTCATAGAT
ATGTTGATTA CATTTGCAGG TTTAGCTTAA
 
Protein sequence
MNSATTTPRP KSPRSRPSDR RQKRKKVRVS TKLLYTRVIR DTFIKLYPKY AIQNPVMFLV 
WLATTTTLAA TIYPPLFGPV TQKNPQLFNG LLTLILFSTV IFANFAEALA EGRGKAQADA
LRSTRSETIA KKIVVDGTIT EISSTTLKQG DTVYVVAGDM IPADGEVILG IASVDESAIT
GESAPVLKES GSDVSSSVTG GTRIISDELI IRITADPGKG FIDRMIDLVE GAECTKTPNE
IALTVLLAVL TTAFFIVVAT LPVFAYYVKS PVSIPVLIAL LVALIPTTIG GLLSAIGIAG
MDRVAQFNLI ATSGRVVEAC GDINTLVLDK TGTITLGNRL AEAFLPINAH SMAEIANVAL
GASIFDDTPE GKSIVRLAEK CGAKFDIDRK KCQGVEFSAT TRMSGTNLPG GHEVRKGAVG
AIKGFVRSRR GSSGASHNGN ETPELVTAYE AVSHQGGTPL LVCLDNEIYG VIYLKDIVKP
GIHERFDQLR RMGVRTVMLT GDNRITASVI AKEAGVDEFI AKATPEDKVS IIQREQAEGK
VVAMTGDGTN DALALAQADV GIAMNTGTKS AKEAANMVDL DSDPTKLVDI VSIGKQLLIT
RGALTIFSIA NDIAKYFAII PVIFAAANLN SLNIMKLTSV NSAILSALIY NALIIPALIP
LALKGVKFRP WTANQLLQRN ILIYGLGGVI VPFIAIKFID MLITFAGLA