Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_5073 |
Symbol | |
ID | 9342882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 5198202 |
End bp | 5201558 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | CBS sensor hybrid histidine kinase |
Protein accession | YP_003723291 |
Protein GI | 298493114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.749087 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCCA AAGGCGGACA GAGGAAGAAT GACCCCTTAA GTCTTATGTT AATGTTACAT TACCCGCTTT ATGACTTTCT ATCAACCGTA CCCAGCTGCC CGGAAACAAG CACTTTGGGA GTGGTGCTGG AGATTTTGGA GAAAGAGCAG TGCGATCGCT TGGTAGTGGT GAATCAACAA CAATACCCTA TAGGATTGTT GTACTCTGCC CGTTTAATCC CGAATTTATT AGCAGCAGCT AGTGGCGAAA CGTTTTTAAC TCTACACCAG TCAATCTGTA ATTTAAGTCC CAGTCTGATT TCACCAATAC AGACAATATC AGCTTCTGAG CATCTGGACA AATTTAATTT GTTTTTGCGT TATCAACAAA ACCAAAAAAA TAAAATTTTA GATTGGGCGT TGACTGATTC AGATGGTAAA TTTGTGGGTC TGGTGGATAA CTCACGCTTA CTAAGTTTTT TGACTAGGAA GAGATCAACA ATAGGTCTGG GCCTCCCAGG CATGGGAACA GAAAATCAGG GTGTGGACAA AACTCGCTCA TACGACCAAA ATGAGCATCA ACCATTCAAG CACAAGCCTT TGGTTAAGTT GCTGGAAAGA CTACCTTGGC CTTTGATGTT ACAAACTGGT AATGGTGAGG TTTTAACGCA AAACCCTGCT TGGTGGCAAC AATTAGGAGT CCTGAAAGAT CCAGAAGGAA TTAGACAACA AGTAGAAACT ATTCTTGCTC CTGTCCGCCC TCAAGACCTA GAATACACTA ATCAACAAGC GGTGAAGATT CATCCCCATA ACAGAGTCAG CGAAACCGAA GAAAGAGCCA CGGAGATGAA ATGGCCTGCT GATAGTAATC ACTTTTCTAG AAGAAATGAA GCTCTTCACT CACCATCGGT TTTACCAATT ATCGACGAGC TACAAGTACC TGCAAATACT ACTTCTGGTA GTCACTGCTA TTTAGATGCT CAACAGGGTA CTTGTACTTG TGTTGTGGAA GTGCAGAATG GTCAGGAGCG AATTTGGCAG TTTGCTAAAA TTCCTTTAGA TAGTCATGAA TTAAAAGTTT TGAGTGGTGA TTCAGAATTT ACGCTCACCA CTGAAAATTC AGAACTCTTA ACTGAGGATC TCTGGTTGGT TTTAGCCACT GATGTGACTG AACAGCAACA ACTTTTCAAA GAATTAGCAG CAAAAAATGC CGATCTGATT CAACTGAATC GGTTAAAAGA TGAGTTTTTA GCTTGTATTA GTCATGAACT GAAAACACCA CTGACAGCAG TTTTGGGTTT GTCACGTTTG TTGGTAGATC AACAATTAGG AGAATTAAAC GAGCGACAAG CCCGTTATGC GAGTTTAATT CATCAAAGTG GCCGCCATTT GATGAGTGTG ATTAATGATA TTTTGGATTT GACTCGGATG GAAACTGGAC AGATGGAACT TACTCCCTCC CCAGTGCAAA TTCGGGAAGT GTGCGATCAC GCTTTATCAG AAGTTAAAAC ACTCCACAAT CAACCTAGCA AATTTATTCA TTCCTCTCGA TCTGAAAGCA AGCATTCACA GGATCATCAA TTCAGCTTAA GTATTGAACC GGGCTTAGAG CAGATAGTTG CAGATGAATT ACGATTGCGT CAAATGTTGG TACACTTGCT TTCTAATGCC TTTAAGTTCA CCGAAACCTC TGGAGAAATT GGGTTGCGGG TGAATCGTTG GGAAGGTTGG ATTGCTTTTA CAGTTTGGGA TACAGGTATT GGTATTCCTG AACATCAACA ACATTTAATC TTTCAAAAAT TCCAACAATT GGAAAATCCC CTGACTCGAC AGTTTGAAGG CACTGGTTTA GGACTAGTAT TAACAAGGGC TTTGGCTCGT CTACATGGGG GAGATGTCAG CTTTTTGTCT CAGGAAGGTA AAGGTAGCCA GTTTACTCTG CTATTACCAC CCACTCCTCC CAGAACCAGC TTTGGGGAAT CAGAAGGGGG AAATTCGGAA TCACATCAAC CCATTTCTTC TCAACGATTA GTGTTAGTGG TTGAAGCAGT AGCTCGATAT ATTGAAGACT TAACCCAACA TCTTAAAAGT TTAGGCTATC GCGTGGTGAT TGCGCGATCG GGAACAGAAG CACTGGAAAA AGCCCGTCGT TTGCAACCAA AAACTATATT TTTGAACCCC TTACTACCCC TACTTTCCGG TTGGGATGTG CTGACTTTAC TAAAATCGGA TAGTGTAACT CGCCATATTC CTGTGATTGT CACAGCAACG GGAGCAGAAA AAGAACACGC TTATGCTCAC CGTGCTGATA GTTTCTTGAG TTTACCAGTT GAACATCAGG CTTTAGTACC GATTTTGGAA AGATTGTCTA ATACACCAGA AGGTGAACAA ATAGGTTTAG AAAATAACAA CAACACCCCA ATAAAAACGC CTTTGCGGAT TCTGAGGTTG GTGAATTCCG AATTAGAATT TGTTAATCCC CAACCTTCTT TGCGAGAACA TCGAGTCATT GAAGTAGATG ATTTAGATCA AGCAGAACTT TTAGCAAGGG TTTGGCAGTT TGATGTCATT TTATTAGATG TGGAAGCTTC TACAGCCCAA AATTATCTCC AAAAGTTGAC TAAGTATCCA CGTTTAGCGG CTATACCTTT GGTGACTTGC GATGTTGCTA CTACTTTAAG TGCTTCCCAA ATACCAGAAT TATCTGTGTT TCCTTACTTA ACTCCTTTGG GCAAAGAAAA TGCTAATTCC AAAGGAAAAC CAGATGCTTT ATTGTCAGTG CTGCAAATTG CTTCTGGTAT TTGCTGCCCT CCCAATATTC TAGTAGTAGA TGTGACCATG TTACGTGATT TGCCACAGGT AAAATCCAAG CAGGTTAAGG GTGAACAACT AGAAAAGAAA TCTGCTCTCA AAAGTGAGTT TGCACAGGGT GAATCTTCTG CAAATACAGA ACGGGGATCA GAATGGTTTC AAGCTTTAAT TCAGTACTTA CAAACTGCTG GCTTTAAAGC GGCTATGGGA AACTGCTGGG CAGAGATGCT GCAACAAATT CGCCATCAAA GTGTTGATTT AATCCTAATT TGTTTGGGTG AATCTGCTAT TCATAAAGAA GTGCAGAAAG CATTAAAAGC TCTGCCAGAT TTACAGTTGA ACTTACCACC TATTTTGGTG ATTGCTCAAA GATTGAATCA GGTGAAAAAT GCAGAAGTTT CACAACCAAA CAAGGAAGGA TATCAGCTAT TAAAAAATAA TAATGTGGTT GGTGGTTTAG CAGATATTGC TGGTAATATT GCTACCCAGA TATTACCTCG CTCTATCTCC ATGGAAGATT TATTAAAACA AATTAATCAG GCTTTGCTAA ATGGTAAAAA TTGTTGA
|
Protein sequence | MDAKGGQRKN DPLSLMLMLH YPLYDFLSTV PSCPETSTLG VVLEILEKEQ CDRLVVVNQQ QYPIGLLYSA RLIPNLLAAA SGETFLTLHQ SICNLSPSLI SPIQTISASE HLDKFNLFLR YQQNQKNKIL DWALTDSDGK FVGLVDNSRL LSFLTRKRST IGLGLPGMGT ENQGVDKTRS YDQNEHQPFK HKPLVKLLER LPWPLMLQTG NGEVLTQNPA WWQQLGVLKD PEGIRQQVET ILAPVRPQDL EYTNQQAVKI HPHNRVSETE ERATEMKWPA DSNHFSRRNE ALHSPSVLPI IDELQVPANT TSGSHCYLDA QQGTCTCVVE VQNGQERIWQ FAKIPLDSHE LKVLSGDSEF TLTTENSELL TEDLWLVLAT DVTEQQQLFK ELAAKNADLI QLNRLKDEFL ACISHELKTP LTAVLGLSRL LVDQQLGELN ERQARYASLI HQSGRHLMSV INDILDLTRM ETGQMELTPS PVQIREVCDH ALSEVKTLHN QPSKFIHSSR SESKHSQDHQ FSLSIEPGLE QIVADELRLR QMLVHLLSNA FKFTETSGEI GLRVNRWEGW IAFTVWDTGI GIPEHQQHLI FQKFQQLENP LTRQFEGTGL GLVLTRALAR LHGGDVSFLS QEGKGSQFTL LLPPTPPRTS FGESEGGNSE SHQPISSQRL VLVVEAVARY IEDLTQHLKS LGYRVVIARS GTEALEKARR LQPKTIFLNP LLPLLSGWDV LTLLKSDSVT RHIPVIVTAT GAEKEHAYAH RADSFLSLPV EHQALVPILE RLSNTPEGEQ IGLENNNNTP IKTPLRILRL VNSELEFVNP QPSLREHRVI EVDDLDQAEL LARVWQFDVI LLDVEASTAQ NYLQKLTKYP RLAAIPLVTC DVATTLSASQ IPELSVFPYL TPLGKENANS KGKPDALLSV LQIASGICCP PNILVVDVTM LRDLPQVKSK QVKGEQLEKK SALKSEFAQG ESSANTERGS EWFQALIQYL QTAGFKAAMG NCWAEMLQQI RHQSVDLILI CLGESAIHKE VQKALKALPD LQLNLPPILV IAQRLNQVKN AEVSQPNKEG YQLLKNNNVV GGLADIAGNI ATQILPRSIS MEDLLKQINQ ALLNGKNC
|
| |