Gene Caul_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3864 
Symbol 
ID5901326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4181055 
End bp4184786 
Gene Length3732 bp 
Protein Length1243 aa 
Translation table11 
GC content66% 
IMG OID641564386 
Productnitrate reductase, alpha subunit 
Protein accessionYP_001685488 
Protein GI167647825 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01580] respiratory nitrate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATA CCCTGGACCG CCTCGCGTTC TTCACGCGCA AGACCGAGCT GTTCTCCGAC 
GGCCACGGCG TCATGAACGA TGACGACCGC ACCTGGGAGC AGGCCTACCG AAACCGCTGG
CAGCACGACA AGATCGTCCG CTCCACCCAT GGGGTGAACT GCACCGGCTC GTGCTCGTGG
AAGATCTACG TCAAGGGCGG CATCGTAACC TGGGAAACCC AGCAGACCGA CTATCCGCGC
ACCCGGCCAG AACTTCCCAA CCATGAACCG CGCGGCTGCG CCCGCGGGGC CAGCTACAGC
TGGTACCTCT ATTCGGCCAA CCGGGTGAAA TACCCGCTGA TCCGTTCGCG GCTGCTGAAG
CTGTGGCGGG CGGCGCGGGT CACGCGCACG CCGGTGGCGG CCTGGGCCTC GATCCAGAAC
GACCCGGCCC AGCGGTCTGA CTATGTGACG CGGCGCGGCG GCGGCGGCTT CATCCGCGCC
ACCTGGGACG AGGTCACCGA GATCATCGCC GCCGCCAACG CCCACACCAT CAAGGCCCAT
GGCCCCGACC GGGTGTTCGG CTTCTCGCCG ATCCCGGCCA TGTCCATGGT CTCCTACGCC
GCCGGCGCCC GTTATCTGCA GCTGATCGGC GGGGTCTGCG GCTCGTTCTA CGACTGGTAC
TGCGACCTGC CGCCGGCCTC GCCCCAGACC TGGGGCGAAC AGACCGACGT CGCCGAAAGC
GCCGACTGGT TCAATTCCAG CTTCATGATC CTGTGGGGCT CCAACGTCCC GCAGACCCGC
ACTCCCGACG CCCACTTCTA TACCGAGGCG CGCTATCGCG GCGCCAAGTC GGTGGTCATC
TGCCCCGACT ATTCGGAAGC CTCCAAGTTC TCCGACCTGT GGCTGCCGGT GAAGCAGGGC
ACTGACGCGG CCCTGGGCAT GGCGTTCGGC CACGTGATCC TCAAGGAGTT CCACGTCGAC
CGCCAGGTCC CGTATTTCCG CGACTATCTG CGCAAATATT CCGATCTGCC GATGCTGGTG
CGCCTGGTTC CGCAGGACGG CGCCTATGTG CCCGAGCGGC TGCTGCGCGC CGCCGAGTTC
GACCAGGCCC TGGGCGAGAC CAACAATCCC GACTGGAAGA CCGTCGCCCT GGACGACACC
ACCGGCCAGG TGGTGGTCCC CAATGGCTCG ATCGGCTTCC GATGGGGCGA GGACGGCAAG
TGGAACCTGG AGGAAAAGGA CGGGGCCGGT CACGAGACCA CCCTGCGCCT GGGCCTCAAG
GGCGTGCATG ACGACGTCGT CGGCGTGCGC TTCCCCTATT TCGGCGGCGC GGCCACCAAC
GGCTTCGCCA TGACCGACCA CCCCGACGTC CTGGTGCGCA ACGTGCCGGT CAAGCGGATG
ATGTTGCCCG AGGGCGAGAC CCTGGTGGCC TGCGTCTACG ACCTGTTCCT GGCCAATTAC
GGCGTCGACC AGGGCTTTGG CGGCGACCAC ATGCCGGCCG ACTATGACGA CGTCCAGCCC
TACTCGCCGG CCTGGGCCGA AGCCATCACC TCGGTGCCGC GCGACCAGAT CCTGGCCGTG
GCCCGCGGCT TCGCCGGCAA CGCCGAGAAG ACCGACGGCA AGTCGATGAT CATTATCGGC
GCGGCGATGA ACCACTGGTA CCACATGGAC ATGAACTACC GCGCCGCCAT CAACATGCTG
GTGATGTGCG GTTGCGTCGG TCAGTCCGGC GGCGGCTGGT CGCACTATGT CGGCCAGGAA
AAGCTGCGGC CGCAGACCGG CTGGGCGCCG CTGGCCTTCG CCACCGACTG GATCAAGCCG
CCAAGGCAGC AGAACTCGAC CTCGTTCTTC TACGCCCATA GCGACCAGTG GCGCTACGAG
ACCGTGGCCA TGGACGAGAT CCTCTCCCCC ACCGCGCCGG ACGGCGTGTG GAACGGCTCG
ATGATCGACT TCAACGCCAA GGCCGAACGC ATGGGCTGGC TGCCCTCGGC CCCGGCCCTG
AAGACCAACC CGCTGGAGGT GGCCAAGGCG GCCGCCGCCA AGGGCCAGGA CGCCAAGGCC
TATACGCTCG ACCAGCTCAA GAGCGGCGAG CTTGAGATGT CGTGCATGGA TCCCGACGAT
CCGGCCAACT GGCCGCGCAA CATGTTCGTC TGGCGCTCCA ACCTGCTCGG CTCATCAGGC
AAGGGCCACG AGTACTTCCT CAAGCACCTG CTGGGCGCTT CGCACGGGGT GCAGGGCAAA
GACCTGGGCG AGACCGGCGG CGTCAAGCCG ACCGAGGTCG CCTGGCACGA CCAGGCGCCG
GAAGGAAAGC TCGACCTGCT GGTCACGCTG GACTTCCGCA TGTCGACCAC GGCGGTCTAT
TCCGACATCG TCCTGCCGAC CGCCACCTGG TACGAAAAGA ACGATCTCAA CACCTCCGAC
ATGCACCCCT TCATCCACCC GCTCAGCGCG GCCGTGGATC CCGGTTGGGA GTCGAAGTCG
GACTGGGAGA TCTTCAAGGC CATCGCCAAG ACCTTCTCCA AGGTCGCGCC CGAAGTGCTG
GGCGTCGAGC AGGACGTCGT CCTGACCCCG ATCCAGCACG ACAGCGCCGC CGAATTGGCC
CAGCCGTTCG ACGTTCGCGA CTGGTCCAAG GGCGAGTGCG AGGCGATCCC CGGCAAGACC
ATGCCGCAGA TCACCATCGT CGAGCGTGAC TACCCCAACA CCTACAAGCG CTTCACCGCT
CTGGGTCCGC TGCTGGCCAA GACCGGCAAC GGCGGCAAGG GCATTGGCTG GAAGACCGAC
CACGAGGTCG ACCTGCTGAA GGCGCTGAAC GGCGAAGTGC TCGAGGAGGG TCAGACCCAG
GGCCTGGCGC GCATCGAGAG CGATATCGAC GCCTGCGAGA CCATCCTGAT GCTGGCCCCC
GAAACCAATG GCGAGGTGGC GGTCAAGGCC TGGGAGGCGC TCGAAAAGCA GACCGGCCGC
GAACACGTCC ACCTGGCCAT GCCCAAGGAA GACGAGAAGA TCCGCTTCCG CGACCTGCTG
GCCCAGCCGC GCAAGATCAT CTCCTCGCCG ACCTGGTCGG GCATCGAGAG CGAGAAGGTC
TGCTACACGG CCGGCTACAC CAACGTCCAC GAACTGATCC CGTGGCGCAC CCTGACCGGC
CGCCAGCAGC TCTACCAGGA TCACCTGTGG ATGCGGGCCT TCGGCGAGGC GCTCTGCGTC
TATCGCCCGC CCATCGATCT GAAGACCACC CACGTCATGG GCGCCAAGCC CAATGGCGAG
AAGGAGATCG TCCTCAACTT CATCACCCCG CACCAGAAGT GGGGCATCCA CTCCACCTAT
AGCGACAACC TGATGATGCT GACGCTCAAC CGCGGGGGTC CCGTGGTCTG GGTGTCGGAG
GTGGACGCCA AGAAGGCCGG CCTGGTCGAT AATGACTGGA TCGAGGTCTT CAACGCCAAC
GGAGCCCTGA CCGCCCGGGT CGTGGTCTCC CAGCGGATCC GTGAAGGCAC CACCTTCATG
TATCACGCCC AGGAAAAGAT CGTGAACACC CCGGGATCGC AGATCACCGG CCTGCGCGGC
GGCATCCATA ACTCGTGCAC CCGTACCGTG CTCAAGCCGA CCCACATGAT CGGCGGCTAC
GCCCAGCTGG CCTACGGCTT CAACTATTAC GGCACCGTCG GCTCCAACCG CGATGAGTTC
GTGGTCGTCC GCAAGATGAC CAAGATCGAC TGGCTCGAAG AAACGCTCGT TCAGAAGGAT
TTCGCCCAAT GA
 
Protein sequence
MSHTLDRLAF FTRKTELFSD GHGVMNDDDR TWEQAYRNRW QHDKIVRSTH GVNCTGSCSW 
KIYVKGGIVT WETQQTDYPR TRPELPNHEP RGCARGASYS WYLYSANRVK YPLIRSRLLK
LWRAARVTRT PVAAWASIQN DPAQRSDYVT RRGGGGFIRA TWDEVTEIIA AANAHTIKAH
GPDRVFGFSP IPAMSMVSYA AGARYLQLIG GVCGSFYDWY CDLPPASPQT WGEQTDVAES
ADWFNSSFMI LWGSNVPQTR TPDAHFYTEA RYRGAKSVVI CPDYSEASKF SDLWLPVKQG
TDAALGMAFG HVILKEFHVD RQVPYFRDYL RKYSDLPMLV RLVPQDGAYV PERLLRAAEF
DQALGETNNP DWKTVALDDT TGQVVVPNGS IGFRWGEDGK WNLEEKDGAG HETTLRLGLK
GVHDDVVGVR FPYFGGAATN GFAMTDHPDV LVRNVPVKRM MLPEGETLVA CVYDLFLANY
GVDQGFGGDH MPADYDDVQP YSPAWAEAIT SVPRDQILAV ARGFAGNAEK TDGKSMIIIG
AAMNHWYHMD MNYRAAINML VMCGCVGQSG GGWSHYVGQE KLRPQTGWAP LAFATDWIKP
PRQQNSTSFF YAHSDQWRYE TVAMDEILSP TAPDGVWNGS MIDFNAKAER MGWLPSAPAL
KTNPLEVAKA AAAKGQDAKA YTLDQLKSGE LEMSCMDPDD PANWPRNMFV WRSNLLGSSG
KGHEYFLKHL LGASHGVQGK DLGETGGVKP TEVAWHDQAP EGKLDLLVTL DFRMSTTAVY
SDIVLPTATW YEKNDLNTSD MHPFIHPLSA AVDPGWESKS DWEIFKAIAK TFSKVAPEVL
GVEQDVVLTP IQHDSAAELA QPFDVRDWSK GECEAIPGKT MPQITIVERD YPNTYKRFTA
LGPLLAKTGN GGKGIGWKTD HEVDLLKALN GEVLEEGQTQ GLARIESDID ACETILMLAP
ETNGEVAVKA WEALEKQTGR EHVHLAMPKE DEKIRFRDLL AQPRKIISSP TWSGIESEKV
CYTAGYTNVH ELIPWRTLTG RQQLYQDHLW MRAFGEALCV YRPPIDLKTT HVMGAKPNGE
KEIVLNFITP HQKWGIHSTY SDNLMMLTLN RGGPVVWVSE VDAKKAGLVD NDWIEVFNAN
GALTARVVVS QRIREGTTFM YHAQEKIVNT PGSQITGLRG GIHNSCTRTV LKPTHMIGGY
AQLAYGFNYY GTVGSNRDEF VVVRKMTKID WLEETLVQKD FAQ