Gene Caul_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3605 
Symbol 
ID5901060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3887437 
End bp3889770 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content71% 
IMG OID641564116 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001685230 
Protein GI167647567 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCC TGCCGCCCCT GAACGGAAAG ATGGGGGGCG AATCCGGCAG CCTGCTGTCG 
CGCCGCACCT TCATCATCGC CTCGACCGGG GCCGGTCTGG CGTTCGGCTT CTCGGCCCTG
GTCGAGGCGG CGATGGATCC GGCCGCGCCG GGCGGCGTCC CGATGACCGC GACGGGACCC
CGCTTCGAGC CCACCCTGTG GTTCGCCATC GACGGCGAGG GGATCGTCAC CGTCAACATC
ATCCGCGCCG AGATGGGCCA GCACGTCGGC ACCGCCCTGG CCCGCATCCT GGCCGACGAG
CTGGACGTGG CCTGGGACAA GGTCCGCATC GTCCACGTCG ACACCGATCC CAAGTGGGGC
CTGATGGTCA CCGGCGGCAG CTGGTCGGTG TGGCAGACCT TCCCGATCTT CAGCCAGGCC
GGGGCGGCCG GGCGGATCGC CCTGGTCGAG GCCGGCGCCA AGCTGCTCAA CGTCTCGCCG
GCTGGCGGAA CGGCGCGGGG CGGCGTGGTC CATGTCGGCG GCAAGTCGAT CTCGTACGGC
GACATCGTCA AGCGCGGCGG CGTGGCGCGC CAGTTCACGC CCGAGGAACT GGCCAAGCTG
CCGGTCAAGC CGGCGGCCGA GCGCAAGCTG ATCGGCAAGC CGGGCAAGGC CCTCGACATC
GCCGCCAAGA CCAACGGGAC GGCGATCTAC GGCATCGACG CCAAGGTCCC GGGCATGGTG
TTCGCGCGGC CCAAGCTGCC GCCCACTCGC CACGGCGCCA AGGTGGTGTC GGTCAACGAC
AGCGCCGCCA AGAAGATCAA GGGCTACCTG CGCTGCGTGA CCCTGGACGA CCCGTCCGAC
ACCGTGCCGG GCTGGGTGAT GGTGATCGCC GAAAGCTATC CCGCCGCGAT CCGCGCCGCC
GACCTGGTCG AGGTCAAATG GGTCGCCGGC CCGGGCGCGA CCGTCTCGGA AAAGGACCTC
CAGGACCACG CCGCCCGCCT GATCGCCAAG CCCGACGGCG GAGCCCTGCT GGAGAACAAG
ACCGCCGACA CCGCCCCGGC CTTCAAGGCC GCCAAGTCGG TGCTGGAGCA GACCTATACC
GCCGCCACCG TCCTGCACTT CCAGCTCGAG CCCGTGAACG CCCTGGCCTT CGAGAAGGAC
GGGGTGATGG AGATCCACAC CGGCAACCAG TGGCAGAGCC TGATCCTGCC CACCCTGGCC
AAGGCCCTGG GCCGCGCCGA GACCAGCATC GTCATGCGCA CCTACATGCT GGGCGGCGGC
TTCGGGCGCC GGCTGAACGG CGACTACGCC GTGCCCGCCG CCCTGGCGGC CAAGGCGCTG
GGCAAGCCCG TGAAGATGGT CCTGACGCGC GAGGACGACG CGCGCTTCGA CTCGGTGCGG
TCGCCCTCGG TCCAGACCGT GCGCATGGCC TTCGACGGGG CCGGCGAGAT CCTCGCTCAG
GAGCATCACG CCGCCGCCGG CTGGCCCACC CTGGCCAACG CCCCCGCCCT GATGCCCAAG
GGCAAGAACG GCGTGGCCTA CGACCCGTTC GCGATCTCCG GGGCCGACCA CTGGTACGAG
GTCGGCGCCC ACAAGGTTCG CGCCATCAAC AACGACCTGG CCAACGAAAC CTTCCGCCCC
GGCTGGCTGC GCTCGGTGGG ACCCGGCTGG ACCAACTGGG CCAGCGAAAG CTTCATCGAC
GAGGCCGCCC ATCACCTGAA GACCGATCCG GTCGCCCTGC GCCTGAAGCT GCTGACCGGG
GCAGGCGCCA ACGCCGGCGG GGACGCCAGC ACCGTCGGCG GGGCCAAGCG TCAGCACGAA
GTGGTGCGCC GCGCCGCCGA GGCCGCGGGC TGGGGGCAAC CTCTGCCCAA GGACACCGGG
CTTGGCCTGG CGACCAGCTT CGGACAGGAG CGGGACATGC CCACCTGGGT GGCCTGCGTC
GCCCGCGTGC ATGTCGATCG CGCCAGCGGC GTGGTCACGG TCCAGAAGCT GACGGTGGTG
ACCGACGCCG GCACGATCGT CGATCCCGAC GGCGCCCTGG CCCAGACCGA GGGCGCGACG
CTGTGGGGCC TGAGCATGGC CCTGCACGAG GGCACGGTGT TCGAGAACGG CCAGGTCAAG
GACGTCAACC TCGATACCTA CACGCCCCTG CGCATCGCCG ACACGCCCGA GCTCGACATC
CGGTTCGTCG ACAGCGTCGA GGTTCCGGTC GGACTGGGCG AGCCGGCCAC CACGGTGGTG
GCGCCTGCCA TCGCCAACGC CATCTTCCGC GCCGTCGGCG TGCGTTTGCG ACACATCCCG
ATCACGCCCG AGGCCGTGCG GACCGCCTTG GCGGGCGGCG CCTCCAGGCT CTAG
 
Protein sequence
MSRLPPLNGK MGGESGSLLS RRTFIIASTG AGLAFGFSAL VEAAMDPAAP GGVPMTATGP 
RFEPTLWFAI DGEGIVTVNI IRAEMGQHVG TALARILADE LDVAWDKVRI VHVDTDPKWG
LMVTGGSWSV WQTFPIFSQA GAAGRIALVE AGAKLLNVSP AGGTARGGVV HVGGKSISYG
DIVKRGGVAR QFTPEELAKL PVKPAAERKL IGKPGKALDI AAKTNGTAIY GIDAKVPGMV
FARPKLPPTR HGAKVVSVND SAAKKIKGYL RCVTLDDPSD TVPGWVMVIA ESYPAAIRAA
DLVEVKWVAG PGATVSEKDL QDHAARLIAK PDGGALLENK TADTAPAFKA AKSVLEQTYT
AATVLHFQLE PVNALAFEKD GVMEIHTGNQ WQSLILPTLA KALGRAETSI VMRTYMLGGG
FGRRLNGDYA VPAALAAKAL GKPVKMVLTR EDDARFDSVR SPSVQTVRMA FDGAGEILAQ
EHHAAAGWPT LANAPALMPK GKNGVAYDPF AISGADHWYE VGAHKVRAIN NDLANETFRP
GWLRSVGPGW TNWASESFID EAAHHLKTDP VALRLKLLTG AGANAGGDAS TVGGAKRQHE
VVRRAAEAAG WGQPLPKDTG LGLATSFGQE RDMPTWVACV ARVHVDRASG VVTVQKLTVV
TDAGTIVDPD GALAQTEGAT LWGLSMALHE GTVFENGQVK DVNLDTYTPL RIADTPELDI
RFVDSVEVPV GLGEPATTVV APAIANAIFR AVGVRLRHIP ITPEAVRTAL AGGASRL