Gene Aazo_5073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5073 
Symbol 
ID9342882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5198202 
End bp5201558 
Gene Length3357 bp 
Protein Length1118 aa 
Translation table11 
GC content41% 
IMG OID 
ProductCBS sensor hybrid histidine kinase 
Protein accessionYP_003723291 
Protein GI298493114 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.749087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCCA AAGGCGGACA GAGGAAGAAT GACCCCTTAA GTCTTATGTT AATGTTACAT 
TACCCGCTTT ATGACTTTCT ATCAACCGTA CCCAGCTGCC CGGAAACAAG CACTTTGGGA
GTGGTGCTGG AGATTTTGGA GAAAGAGCAG TGCGATCGCT TGGTAGTGGT GAATCAACAA
CAATACCCTA TAGGATTGTT GTACTCTGCC CGTTTAATCC CGAATTTATT AGCAGCAGCT
AGTGGCGAAA CGTTTTTAAC TCTACACCAG TCAATCTGTA ATTTAAGTCC CAGTCTGATT
TCACCAATAC AGACAATATC AGCTTCTGAG CATCTGGACA AATTTAATTT GTTTTTGCGT
TATCAACAAA ACCAAAAAAA TAAAATTTTA GATTGGGCGT TGACTGATTC AGATGGTAAA
TTTGTGGGTC TGGTGGATAA CTCACGCTTA CTAAGTTTTT TGACTAGGAA GAGATCAACA
ATAGGTCTGG GCCTCCCAGG CATGGGAACA GAAAATCAGG GTGTGGACAA AACTCGCTCA
TACGACCAAA ATGAGCATCA ACCATTCAAG CACAAGCCTT TGGTTAAGTT GCTGGAAAGA
CTACCTTGGC CTTTGATGTT ACAAACTGGT AATGGTGAGG TTTTAACGCA AAACCCTGCT
TGGTGGCAAC AATTAGGAGT CCTGAAAGAT CCAGAAGGAA TTAGACAACA AGTAGAAACT
ATTCTTGCTC CTGTCCGCCC TCAAGACCTA GAATACACTA ATCAACAAGC GGTGAAGATT
CATCCCCATA ACAGAGTCAG CGAAACCGAA GAAAGAGCCA CGGAGATGAA ATGGCCTGCT
GATAGTAATC ACTTTTCTAG AAGAAATGAA GCTCTTCACT CACCATCGGT TTTACCAATT
ATCGACGAGC TACAAGTACC TGCAAATACT ACTTCTGGTA GTCACTGCTA TTTAGATGCT
CAACAGGGTA CTTGTACTTG TGTTGTGGAA GTGCAGAATG GTCAGGAGCG AATTTGGCAG
TTTGCTAAAA TTCCTTTAGA TAGTCATGAA TTAAAAGTTT TGAGTGGTGA TTCAGAATTT
ACGCTCACCA CTGAAAATTC AGAACTCTTA ACTGAGGATC TCTGGTTGGT TTTAGCCACT
GATGTGACTG AACAGCAACA ACTTTTCAAA GAATTAGCAG CAAAAAATGC CGATCTGATT
CAACTGAATC GGTTAAAAGA TGAGTTTTTA GCTTGTATTA GTCATGAACT GAAAACACCA
CTGACAGCAG TTTTGGGTTT GTCACGTTTG TTGGTAGATC AACAATTAGG AGAATTAAAC
GAGCGACAAG CCCGTTATGC GAGTTTAATT CATCAAAGTG GCCGCCATTT GATGAGTGTG
ATTAATGATA TTTTGGATTT GACTCGGATG GAAACTGGAC AGATGGAACT TACTCCCTCC
CCAGTGCAAA TTCGGGAAGT GTGCGATCAC GCTTTATCAG AAGTTAAAAC ACTCCACAAT
CAACCTAGCA AATTTATTCA TTCCTCTCGA TCTGAAAGCA AGCATTCACA GGATCATCAA
TTCAGCTTAA GTATTGAACC GGGCTTAGAG CAGATAGTTG CAGATGAATT ACGATTGCGT
CAAATGTTGG TACACTTGCT TTCTAATGCC TTTAAGTTCA CCGAAACCTC TGGAGAAATT
GGGTTGCGGG TGAATCGTTG GGAAGGTTGG ATTGCTTTTA CAGTTTGGGA TACAGGTATT
GGTATTCCTG AACATCAACA ACATTTAATC TTTCAAAAAT TCCAACAATT GGAAAATCCC
CTGACTCGAC AGTTTGAAGG CACTGGTTTA GGACTAGTAT TAACAAGGGC TTTGGCTCGT
CTACATGGGG GAGATGTCAG CTTTTTGTCT CAGGAAGGTA AAGGTAGCCA GTTTACTCTG
CTATTACCAC CCACTCCTCC CAGAACCAGC TTTGGGGAAT CAGAAGGGGG AAATTCGGAA
TCACATCAAC CCATTTCTTC TCAACGATTA GTGTTAGTGG TTGAAGCAGT AGCTCGATAT
ATTGAAGACT TAACCCAACA TCTTAAAAGT TTAGGCTATC GCGTGGTGAT TGCGCGATCG
GGAACAGAAG CACTGGAAAA AGCCCGTCGT TTGCAACCAA AAACTATATT TTTGAACCCC
TTACTACCCC TACTTTCCGG TTGGGATGTG CTGACTTTAC TAAAATCGGA TAGTGTAACT
CGCCATATTC CTGTGATTGT CACAGCAACG GGAGCAGAAA AAGAACACGC TTATGCTCAC
CGTGCTGATA GTTTCTTGAG TTTACCAGTT GAACATCAGG CTTTAGTACC GATTTTGGAA
AGATTGTCTA ATACACCAGA AGGTGAACAA ATAGGTTTAG AAAATAACAA CAACACCCCA
ATAAAAACGC CTTTGCGGAT TCTGAGGTTG GTGAATTCCG AATTAGAATT TGTTAATCCC
CAACCTTCTT TGCGAGAACA TCGAGTCATT GAAGTAGATG ATTTAGATCA AGCAGAACTT
TTAGCAAGGG TTTGGCAGTT TGATGTCATT TTATTAGATG TGGAAGCTTC TACAGCCCAA
AATTATCTCC AAAAGTTGAC TAAGTATCCA CGTTTAGCGG CTATACCTTT GGTGACTTGC
GATGTTGCTA CTACTTTAAG TGCTTCCCAA ATACCAGAAT TATCTGTGTT TCCTTACTTA
ACTCCTTTGG GCAAAGAAAA TGCTAATTCC AAAGGAAAAC CAGATGCTTT ATTGTCAGTG
CTGCAAATTG CTTCTGGTAT TTGCTGCCCT CCCAATATTC TAGTAGTAGA TGTGACCATG
TTACGTGATT TGCCACAGGT AAAATCCAAG CAGGTTAAGG GTGAACAACT AGAAAAGAAA
TCTGCTCTCA AAAGTGAGTT TGCACAGGGT GAATCTTCTG CAAATACAGA ACGGGGATCA
GAATGGTTTC AAGCTTTAAT TCAGTACTTA CAAACTGCTG GCTTTAAAGC GGCTATGGGA
AACTGCTGGG CAGAGATGCT GCAACAAATT CGCCATCAAA GTGTTGATTT AATCCTAATT
TGTTTGGGTG AATCTGCTAT TCATAAAGAA GTGCAGAAAG CATTAAAAGC TCTGCCAGAT
TTACAGTTGA ACTTACCACC TATTTTGGTG ATTGCTCAAA GATTGAATCA GGTGAAAAAT
GCAGAAGTTT CACAACCAAA CAAGGAAGGA TATCAGCTAT TAAAAAATAA TAATGTGGTT
GGTGGTTTAG CAGATATTGC TGGTAATATT GCTACCCAGA TATTACCTCG CTCTATCTCC
ATGGAAGATT TATTAAAACA AATTAATCAG GCTTTGCTAA ATGGTAAAAA TTGTTGA
 
Protein sequence
MDAKGGQRKN DPLSLMLMLH YPLYDFLSTV PSCPETSTLG VVLEILEKEQ CDRLVVVNQQ 
QYPIGLLYSA RLIPNLLAAA SGETFLTLHQ SICNLSPSLI SPIQTISASE HLDKFNLFLR
YQQNQKNKIL DWALTDSDGK FVGLVDNSRL LSFLTRKRST IGLGLPGMGT ENQGVDKTRS
YDQNEHQPFK HKPLVKLLER LPWPLMLQTG NGEVLTQNPA WWQQLGVLKD PEGIRQQVET
ILAPVRPQDL EYTNQQAVKI HPHNRVSETE ERATEMKWPA DSNHFSRRNE ALHSPSVLPI
IDELQVPANT TSGSHCYLDA QQGTCTCVVE VQNGQERIWQ FAKIPLDSHE LKVLSGDSEF
TLTTENSELL TEDLWLVLAT DVTEQQQLFK ELAAKNADLI QLNRLKDEFL ACISHELKTP
LTAVLGLSRL LVDQQLGELN ERQARYASLI HQSGRHLMSV INDILDLTRM ETGQMELTPS
PVQIREVCDH ALSEVKTLHN QPSKFIHSSR SESKHSQDHQ FSLSIEPGLE QIVADELRLR
QMLVHLLSNA FKFTETSGEI GLRVNRWEGW IAFTVWDTGI GIPEHQQHLI FQKFQQLENP
LTRQFEGTGL GLVLTRALAR LHGGDVSFLS QEGKGSQFTL LLPPTPPRTS FGESEGGNSE
SHQPISSQRL VLVVEAVARY IEDLTQHLKS LGYRVVIARS GTEALEKARR LQPKTIFLNP
LLPLLSGWDV LTLLKSDSVT RHIPVIVTAT GAEKEHAYAH RADSFLSLPV EHQALVPILE
RLSNTPEGEQ IGLENNNNTP IKTPLRILRL VNSELEFVNP QPSLREHRVI EVDDLDQAEL
LARVWQFDVI LLDVEASTAQ NYLQKLTKYP RLAAIPLVTC DVATTLSASQ IPELSVFPYL
TPLGKENANS KGKPDALLSV LQIASGICCP PNILVVDVTM LRDLPQVKSK QVKGEQLEKK
SALKSEFAQG ESSANTERGS EWFQALIQYL QTAGFKAAMG NCWAEMLQQI RHQSVDLILI
CLGESAIHKE VQKALKALPD LQLNLPPILV IAQRLNQVKN AEVSQPNKEG YQLLKNNNVV
GGLADIAGNI ATQILPRSIS MEDLLKQINQ ALLNGKNC