Gene Aazo_5029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5029 
Symbol 
ID9342837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5148150 
End bp5149763 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content32% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003723261 
Protein GI298493084 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.559504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTAT TTTATAAGCT TAAATTTACT CCTTCTCAGT TAGTAAGTCT GACAGTATTG 
CTAACATTAC TATTATTTAT TCCTCAAGTT TGGCTCAATT GGCAAGCATA TTATAACTTC
AACAATATTG CTAAACAGGA ATTTCACCTC CAAAAATTAA GTAATGAAAT TACTTATTTG
GATGAAGTAT TAACCATGTC AGCCCGGATG AATGCGGCTA CAGGTAATAT TCTTTGGGAA
AAAAGATATC GCCAATTTGA ACCTAAACTC GATCTGGCAA TTAAAGAAGC TATTAAAGTA
GGTCCTGAAA CATATAAAGA TCAGAATCCC CAAAAAATTG ATATTGCTAA TCAGCAGTTA
ATTGCCATGG AATATAAATC TTTTGATTTA GTTAATAAAA ATCAAAAACA AGCTGCACAA
AAAATATTGT CCAGCCGTAA ATACGAAACT CAGAAACATA TTTATACCGA TGGTATTGCT
AAAAGGAACA AGAATATATC ACTTGCATTG GAACAAAAAG TTGCTGAATA TCGCCAAGGA
ATGATTTGGG CTATTTTAGG TTCTATTTTA AGTTTAACAA TACTTATCCC AATATGGATT
TTAGTATTAC GTCTATTGCA AGAATATTTA AAAGCTAAAA AAAATGCTCA AGCTGCCCTA
GAAGAAACTA ATTATATGTT AGAAATGCAA GTTGCAACGC GAACGGCAAG CTTAAACCAG
AAAAATCTTC AATTACAAAA GACACTACAA GAACTTCAGC AGACTCAAGT ACAACTGATT
CAAACTGAAA AGATGTCTTC ACTAGGTCAG TTAGTTGCTG GTGTTGCTCA TGAGATTAAT
AATCCTGTTA ATTTCATTTA TGGCAACTTA ATTCACGTTA GGGAATACAC TCAGAATTTA
TTAACTTTGA TTAACCGATA CCGACAAGAA AATTCTAATT ACAATCCAGA AATAAATATT
TTGATTGAGG AGATAGAATT AGATTTTCTG ATTGATGATC TCCCTAAAAT ATTATCCTCT
ATGGCAGTTG GTGCTGAACG TATCCGTGAG ATTGTCTTAA GTTTACGGAA TTTCTCGCGT
CTTGATGAAG CAGAAATGAA GCCTGTTAAT ATTCATGAAG GGCTTGATAG TACACTGTTA
ATTTTGCAAG ATATTATTAA AGGTCAGGAG GAACAGCAGG AAATTTTAAT TATTAAAGAC
TATGGAAATT TACCTCTTGT TGAATGTTAT GCAGGAGGAT TAAATCAGGT ATTTATGAAT
ATAATTGTGA ACGCTATTGA TGCTTTGCGT CAACAGGAAA TAGATTCTTC TAAAGATATT
AACAAACACT TCAGTTCAAT TATTATTCAT ACTCAAGTCA GAAATGACGA GAAGGTAATT
ATCAACATTC AAGATAATGG CATAGGAATA GGAGAAACAG TTAAAAATAA ATTGTTTGAA
CCATTTTTTA CTACTAAACC TGTAGGTAAA GGTACTGGAT TAGGATTATC TATTAGTTAC
CAGATTATAG TAGATAAGCA TAAAGGAAAG ATCGAGTGTA TTTCTGAACC TGAAAAAGGA
ACAGAATTTG TGATTGAGAT TCCTATTAGA CAGATAAAGC CAGCAAATAC ATAG
 
Protein sequence
MHLFYKLKFT PSQLVSLTVL LTLLLFIPQV WLNWQAYYNF NNIAKQEFHL QKLSNEITYL 
DEVLTMSARM NAATGNILWE KRYRQFEPKL DLAIKEAIKV GPETYKDQNP QKIDIANQQL
IAMEYKSFDL VNKNQKQAAQ KILSSRKYET QKHIYTDGIA KRNKNISLAL EQKVAEYRQG
MIWAILGSIL SLTILIPIWI LVLRLLQEYL KAKKNAQAAL EETNYMLEMQ VATRTASLNQ
KNLQLQKTLQ ELQQTQVQLI QTEKMSSLGQ LVAGVAHEIN NPVNFIYGNL IHVREYTQNL
LTLINRYRQE NSNYNPEINI LIEEIELDFL IDDLPKILSS MAVGAERIRE IVLSLRNFSR
LDEAEMKPVN IHEGLDSTLL ILQDIIKGQE EQQEILIIKD YGNLPLVECY AGGLNQVFMN
IIVNAIDALR QQEIDSSKDI NKHFSSIIIH TQVRNDEKVI INIQDNGIGI GETVKNKLFE
PFFTTKPVGK GTGLGLSISY QIIVDKHKGK IECISEPEKG TEFVIEIPIR QIKPANT