Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3864 |
Symbol | |
ID | 5901326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4181055 |
End bp | 4184786 |
Gene Length | 3732 bp |
Protein Length | 1243 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564386 |
Product | nitrate reductase, alpha subunit |
Protein accession | YP_001685488 |
Protein GI | 167647825 |
COG category | [C] Energy production and conversion |
COG ID | [COG5013] Nitrate reductase alpha subunit |
TIGRFAM ID | [TIGR01580] respiratory nitrate reductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATA CCCTGGACCG CCTCGCGTTC TTCACGCGCA AGACCGAGCT GTTCTCCGAC GGCCACGGCG TCATGAACGA TGACGACCGC ACCTGGGAGC AGGCCTACCG AAACCGCTGG CAGCACGACA AGATCGTCCG CTCCACCCAT GGGGTGAACT GCACCGGCTC GTGCTCGTGG AAGATCTACG TCAAGGGCGG CATCGTAACC TGGGAAACCC AGCAGACCGA CTATCCGCGC ACCCGGCCAG AACTTCCCAA CCATGAACCG CGCGGCTGCG CCCGCGGGGC CAGCTACAGC TGGTACCTCT ATTCGGCCAA CCGGGTGAAA TACCCGCTGA TCCGTTCGCG GCTGCTGAAG CTGTGGCGGG CGGCGCGGGT CACGCGCACG CCGGTGGCGG CCTGGGCCTC GATCCAGAAC GACCCGGCCC AGCGGTCTGA CTATGTGACG CGGCGCGGCG GCGGCGGCTT CATCCGCGCC ACCTGGGACG AGGTCACCGA GATCATCGCC GCCGCCAACG CCCACACCAT CAAGGCCCAT GGCCCCGACC GGGTGTTCGG CTTCTCGCCG ATCCCGGCCA TGTCCATGGT CTCCTACGCC GCCGGCGCCC GTTATCTGCA GCTGATCGGC GGGGTCTGCG GCTCGTTCTA CGACTGGTAC TGCGACCTGC CGCCGGCCTC GCCCCAGACC TGGGGCGAAC AGACCGACGT CGCCGAAAGC GCCGACTGGT TCAATTCCAG CTTCATGATC CTGTGGGGCT CCAACGTCCC GCAGACCCGC ACTCCCGACG CCCACTTCTA TACCGAGGCG CGCTATCGCG GCGCCAAGTC GGTGGTCATC TGCCCCGACT ATTCGGAAGC CTCCAAGTTC TCCGACCTGT GGCTGCCGGT GAAGCAGGGC ACTGACGCGG CCCTGGGCAT GGCGTTCGGC CACGTGATCC TCAAGGAGTT CCACGTCGAC CGCCAGGTCC CGTATTTCCG CGACTATCTG CGCAAATATT CCGATCTGCC GATGCTGGTG CGCCTGGTTC CGCAGGACGG CGCCTATGTG CCCGAGCGGC TGCTGCGCGC CGCCGAGTTC GACCAGGCCC TGGGCGAGAC CAACAATCCC GACTGGAAGA CCGTCGCCCT GGACGACACC ACCGGCCAGG TGGTGGTCCC CAATGGCTCG ATCGGCTTCC GATGGGGCGA GGACGGCAAG TGGAACCTGG AGGAAAAGGA CGGGGCCGGT CACGAGACCA CCCTGCGCCT GGGCCTCAAG GGCGTGCATG ACGACGTCGT CGGCGTGCGC TTCCCCTATT TCGGCGGCGC GGCCACCAAC GGCTTCGCCA TGACCGACCA CCCCGACGTC CTGGTGCGCA ACGTGCCGGT CAAGCGGATG ATGTTGCCCG AGGGCGAGAC CCTGGTGGCC TGCGTCTACG ACCTGTTCCT GGCCAATTAC GGCGTCGACC AGGGCTTTGG CGGCGACCAC ATGCCGGCCG ACTATGACGA CGTCCAGCCC TACTCGCCGG CCTGGGCCGA AGCCATCACC TCGGTGCCGC GCGACCAGAT CCTGGCCGTG GCCCGCGGCT TCGCCGGCAA CGCCGAGAAG ACCGACGGCA AGTCGATGAT CATTATCGGC GCGGCGATGA ACCACTGGTA CCACATGGAC ATGAACTACC GCGCCGCCAT CAACATGCTG GTGATGTGCG GTTGCGTCGG TCAGTCCGGC GGCGGCTGGT CGCACTATGT CGGCCAGGAA AAGCTGCGGC CGCAGACCGG CTGGGCGCCG CTGGCCTTCG CCACCGACTG GATCAAGCCG CCAAGGCAGC AGAACTCGAC CTCGTTCTTC TACGCCCATA GCGACCAGTG GCGCTACGAG ACCGTGGCCA TGGACGAGAT CCTCTCCCCC ACCGCGCCGG ACGGCGTGTG GAACGGCTCG ATGATCGACT TCAACGCCAA GGCCGAACGC ATGGGCTGGC TGCCCTCGGC CCCGGCCCTG AAGACCAACC CGCTGGAGGT GGCCAAGGCG GCCGCCGCCA AGGGCCAGGA CGCCAAGGCC TATACGCTCG ACCAGCTCAA GAGCGGCGAG CTTGAGATGT CGTGCATGGA TCCCGACGAT CCGGCCAACT GGCCGCGCAA CATGTTCGTC TGGCGCTCCA ACCTGCTCGG CTCATCAGGC AAGGGCCACG AGTACTTCCT CAAGCACCTG CTGGGCGCTT CGCACGGGGT GCAGGGCAAA GACCTGGGCG AGACCGGCGG CGTCAAGCCG ACCGAGGTCG CCTGGCACGA CCAGGCGCCG GAAGGAAAGC TCGACCTGCT GGTCACGCTG GACTTCCGCA TGTCGACCAC GGCGGTCTAT TCCGACATCG TCCTGCCGAC CGCCACCTGG TACGAAAAGA ACGATCTCAA CACCTCCGAC ATGCACCCCT TCATCCACCC GCTCAGCGCG GCCGTGGATC CCGGTTGGGA GTCGAAGTCG GACTGGGAGA TCTTCAAGGC CATCGCCAAG ACCTTCTCCA AGGTCGCGCC CGAAGTGCTG GGCGTCGAGC AGGACGTCGT CCTGACCCCG ATCCAGCACG ACAGCGCCGC CGAATTGGCC CAGCCGTTCG ACGTTCGCGA CTGGTCCAAG GGCGAGTGCG AGGCGATCCC CGGCAAGACC ATGCCGCAGA TCACCATCGT CGAGCGTGAC TACCCCAACA CCTACAAGCG CTTCACCGCT CTGGGTCCGC TGCTGGCCAA GACCGGCAAC GGCGGCAAGG GCATTGGCTG GAAGACCGAC CACGAGGTCG ACCTGCTGAA GGCGCTGAAC GGCGAAGTGC TCGAGGAGGG TCAGACCCAG GGCCTGGCGC GCATCGAGAG CGATATCGAC GCCTGCGAGA CCATCCTGAT GCTGGCCCCC GAAACCAATG GCGAGGTGGC GGTCAAGGCC TGGGAGGCGC TCGAAAAGCA GACCGGCCGC GAACACGTCC ACCTGGCCAT GCCCAAGGAA GACGAGAAGA TCCGCTTCCG CGACCTGCTG GCCCAGCCGC GCAAGATCAT CTCCTCGCCG ACCTGGTCGG GCATCGAGAG CGAGAAGGTC TGCTACACGG CCGGCTACAC CAACGTCCAC GAACTGATCC CGTGGCGCAC CCTGACCGGC CGCCAGCAGC TCTACCAGGA TCACCTGTGG ATGCGGGCCT TCGGCGAGGC GCTCTGCGTC TATCGCCCGC CCATCGATCT GAAGACCACC CACGTCATGG GCGCCAAGCC CAATGGCGAG AAGGAGATCG TCCTCAACTT CATCACCCCG CACCAGAAGT GGGGCATCCA CTCCACCTAT AGCGACAACC TGATGATGCT GACGCTCAAC CGCGGGGGTC CCGTGGTCTG GGTGTCGGAG GTGGACGCCA AGAAGGCCGG CCTGGTCGAT AATGACTGGA TCGAGGTCTT CAACGCCAAC GGAGCCCTGA CCGCCCGGGT CGTGGTCTCC CAGCGGATCC GTGAAGGCAC CACCTTCATG TATCACGCCC AGGAAAAGAT CGTGAACACC CCGGGATCGC AGATCACCGG CCTGCGCGGC GGCATCCATA ACTCGTGCAC CCGTACCGTG CTCAAGCCGA CCCACATGAT CGGCGGCTAC GCCCAGCTGG CCTACGGCTT CAACTATTAC GGCACCGTCG GCTCCAACCG CGATGAGTTC GTGGTCGTCC GCAAGATGAC CAAGATCGAC TGGCTCGAAG AAACGCTCGT TCAGAAGGAT TTCGCCCAAT GA
|
Protein sequence | MSHTLDRLAF FTRKTELFSD GHGVMNDDDR TWEQAYRNRW QHDKIVRSTH GVNCTGSCSW KIYVKGGIVT WETQQTDYPR TRPELPNHEP RGCARGASYS WYLYSANRVK YPLIRSRLLK LWRAARVTRT PVAAWASIQN DPAQRSDYVT RRGGGGFIRA TWDEVTEIIA AANAHTIKAH GPDRVFGFSP IPAMSMVSYA AGARYLQLIG GVCGSFYDWY CDLPPASPQT WGEQTDVAES ADWFNSSFMI LWGSNVPQTR TPDAHFYTEA RYRGAKSVVI CPDYSEASKF SDLWLPVKQG TDAALGMAFG HVILKEFHVD RQVPYFRDYL RKYSDLPMLV RLVPQDGAYV PERLLRAAEF DQALGETNNP DWKTVALDDT TGQVVVPNGS IGFRWGEDGK WNLEEKDGAG HETTLRLGLK GVHDDVVGVR FPYFGGAATN GFAMTDHPDV LVRNVPVKRM MLPEGETLVA CVYDLFLANY GVDQGFGGDH MPADYDDVQP YSPAWAEAIT SVPRDQILAV ARGFAGNAEK TDGKSMIIIG AAMNHWYHMD MNYRAAINML VMCGCVGQSG GGWSHYVGQE KLRPQTGWAP LAFATDWIKP PRQQNSTSFF YAHSDQWRYE TVAMDEILSP TAPDGVWNGS MIDFNAKAER MGWLPSAPAL KTNPLEVAKA AAAKGQDAKA YTLDQLKSGE LEMSCMDPDD PANWPRNMFV WRSNLLGSSG KGHEYFLKHL LGASHGVQGK DLGETGGVKP TEVAWHDQAP EGKLDLLVTL DFRMSTTAVY SDIVLPTATW YEKNDLNTSD MHPFIHPLSA AVDPGWESKS DWEIFKAIAK TFSKVAPEVL GVEQDVVLTP IQHDSAAELA QPFDVRDWSK GECEAIPGKT MPQITIVERD YPNTYKRFTA LGPLLAKTGN GGKGIGWKTD HEVDLLKALN GEVLEEGQTQ GLARIESDID ACETILMLAP ETNGEVAVKA WEALEKQTGR EHVHLAMPKE DEKIRFRDLL AQPRKIISSP TWSGIESEKV CYTAGYTNVH ELIPWRTLTG RQQLYQDHLW MRAFGEALCV YRPPIDLKTT HVMGAKPNGE KEIVLNFITP HQKWGIHSTY SDNLMMLTLN RGGPVVWVSE VDAKKAGLVD NDWIEVFNAN GALTARVVVS QRIREGTTFM YHAQEKIVNT PGSQITGLRG GIHNSCTRTV LKPTHMIGGY AQLAYGFNYY GTVGSNRDEF VVVRKMTKID WLEETLVQKD FAQ
|
| |