Gene Aazo_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1199 
Symbol 
ID9338994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1278885 
End bp1280843 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content40% 
IMG OID 
ProductATP-binding region ATPase domain-containing protein 
Protein accessionYP_003720638 
Protein GI298490461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGAAC AAGGCACTAT CAGTATACAT ACTGATAATA TTTTCCCAAT TATCAAAAAG 
TCTCTCTATT CAGATCACCA AATTTTCTTG CGGGAACTGG TATCCAACGC TGTAGATGCC
ATCCAAAAGC TAAAAATGGT GTCCCGCGCT GGTGAGTATA ATGGAGACAC GGGTGAACCA
GAAATTACAA TTGGCATTGA TCAAGATAAA AAGACCCTCT CCATCTCCGA TAATGGGATT
GGGATGACAG CAGAGGAAGT CAAAAAATAT ATTAACCAAG TCGCTTTCTC AAGTGCAGAA
GAATTTATTC ACAAATATGA AGGGAAAGCA GATCAACCAA TAATCGGACA CTTTGGTTTA
GGTTTCTACT CCTCCTTCAT GGTGGCGAAA AAAGTAGACA TTGATACTCT TTCCTATCAA
GAAGGTTCTC AAGCAGTTCA CTGGACTTGT GATGGTTCAC CAGAGTTTAC CTTAGATGAG
TCTTCTCGCA CTACTCGCGG TACTACTATT ACCCTCACTT TAATGCCAGA TGAGGAAGAA
TATTTAGAAG CTGCGAGAAT TAGAACTCTA GTGAAAACTT ACTGTGATTT TATGCCCGTA
CCCATCAAAT TAGATGGGCA AGTATTGAAC CAAGAAAAAG CACCTTGGAG GGAATCTCCT
AGCAATTTAA CCAAAGAAGA TTATTTAGAA TTTTACCGCT ATCTATATCC TTTTCAAGAA
GAACCTTTGT TATGGGTGCA TCTGAATACA GATTATCCGT TTATTATTAA CGGGATTATG
TATTTTCCCA AGATGCGGCC TGATGTGGAT GTGACTAAAG GACAAATTAA GTTATTCTGC
AATCAGGTTT TTGTTAGCGA TAACTGTGAA GAAATTATCC CCCAATTTTT AATGCCCATG
CGGGGTGTGA TTGATAGTAC GGATATTCCA TTGAACGTTT CTCGCAGTGC TTTACAGGGG
GATCGCACTG TTAAACGTAT TGGTGACTAC ATAGCAAAGA AAGTAGGTGA TCGCCTCAAA
GAATTATACC GCGACGACCG CGAACAATAC ATCAGTGCTT GGAAAGACTT AGGAACATTC
GTTAAATTTG GCGTTCTCAA CGACGATAAA TTTAAAAAAC AAGTCGAAGA CATCCTCATC
TTCCGCAGCA CTGCTAAAAT AGAAACAACA CCCGCAGTTG AAGTCCAATC ATCAGAAGGT
GATCTCTGGC AAGATGTCAC CCCATCTAAC ACCAGCAGCA CACCTTACAC CACCATCAAA
GAATATCTAG AACGCAACAA AGAACGCCAC GAAAACCGCG TTTTTTACAG CACCGATGAA
GCCAGTCAAT CCACATATAT TGAACTGCAC AAAAACCAAG GTTTAGAAGT CCTATTTATG
GACTCCTTCA TCGACACCCA CTTTATTAAC TTCCTAGAAA GAGAATATCA GGATGTTAAA
TTTACACGGG TAGATTCTGA CCTTGATAAT ACCTTATTGG ATGATAAATC CGGCGAAATT
GTTGACCCCA CCACCAACAA AACCAAAAGT GAAATTATCA AAGAACTATT TGAGAAATCA
CTCAACAGAC CCAAAGTTAA CATCCGCACG GAAGCCTTAA AATCAGATGA CCCTCAAGGA
ACACCACCAG CAATGGTATT GTTACCAGAA ATTCTCCGTC GTCTAAGGGA AATGAACGCC
ATGATGCAAC AGCAAAACGC CGATTTTCCT GAAGATCATA TTTTGTTAGT AAATACCGCA
CATCCCTTGA TTCAAAACCT CGCTAATATC AATCAAGGTA GTATCATCAT TCAAAGTGAT
GGAGAATCAC CCACAGAACA GTTAGTGAAA ATGATTTGTC AGCACGTTTA TGATTTAGCA
TTAATGTCCC AAAAAGGCTT TGATGCTGAA GGTATGAAAT CCTTTGTGGA ACGTTCTAAT
GATGTGTTGA CGAAGCTAAC AGAACAAGCG AGTAAGTAG
 
Protein sequence
MLEQGTISIH TDNIFPIIKK SLYSDHQIFL RELVSNAVDA IQKLKMVSRA GEYNGDTGEP 
EITIGIDQDK KTLSISDNGI GMTAEEVKKY INQVAFSSAE EFIHKYEGKA DQPIIGHFGL
GFYSSFMVAK KVDIDTLSYQ EGSQAVHWTC DGSPEFTLDE SSRTTRGTTI TLTLMPDEEE
YLEAARIRTL VKTYCDFMPV PIKLDGQVLN QEKAPWRESP SNLTKEDYLE FYRYLYPFQE
EPLLWVHLNT DYPFIINGIM YFPKMRPDVD VTKGQIKLFC NQVFVSDNCE EIIPQFLMPM
RGVIDSTDIP LNVSRSALQG DRTVKRIGDY IAKKVGDRLK ELYRDDREQY ISAWKDLGTF
VKFGVLNDDK FKKQVEDILI FRSTAKIETT PAVEVQSSEG DLWQDVTPSN TSSTPYTTIK
EYLERNKERH ENRVFYSTDE ASQSTYIELH KNQGLEVLFM DSFIDTHFIN FLEREYQDVK
FTRVDSDLDN TLLDDKSGEI VDPTTNKTKS EIIKELFEKS LNRPKVNIRT EALKSDDPQG
TPPAMVLLPE ILRRLREMNA MMQQQNADFP EDHILLVNTA HPLIQNLANI NQGSIIIQSD
GESPTEQLVK MICQHVYDLA LMSQKGFDAE GMKSFVERSN DVLTKLTEQA SK