Gene Caul_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1417 
Symbol 
ID5898872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1505788 
End bp1507971 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content71% 
IMG OID641561904 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001683045 
Protein GI167645382 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA TCGTTCTCGA CAACCCCTCG CGTCGCCTGT TCCTGAAGAC CGGCGCGGCG 
GCCGGCGGCG GGCTGCTGCT GTCGTTCAAC CTGCCGGGCC TGGTCCGGGC CGAGGGCGAG
GCCGCGGCCA ACCCGTTCAA CGCTTTCGTC CGCATCGCGC CGGACGGCGT CGTGACCATC
GCCGCCAAGC GGCCGGAGAT CGGGCAGGGG ATCAAGACCT CGATGCCGAT GCTGATCGCC
GAGGAGCTGG ACGTCGACTG GTCCAGCGTT CGCATCGAGC AGGCCCCGAT CGACGCCAAG
GTGTTCGGCG AACAGTCGGC CGGCGGCAGC ACCTCCACGC CCGACGACTG GGACCCGATG
CGCCAGGTCG GGGCGGTGGG GCGAGCCTTG CTGATCCAGG CGGCCGCTCA GCGCTGGAAC
ATTCCCGCCG CCGGTCTGGC GACCGAACCC GGCTTCGTGA TCGACAAGGC CGCCAAGCGC
CGCGCCTCCT ACGGTGAGCT GGCCCAGGCC GCCGCCAGCC TGGCCGCGCC CGATCCCAAG
ACCCTGACGC TGAAGGATCC CAAGACCTAC CGCATCATCG GCAAGCCCCA GCCGCAGTCG
GATACGCCGG CGATCGTCAC CGGCCAGCCG CTCTACGGGA TCGATGTGCG CGTGCCGGGC
ATGCTGTACG CCACCTTCCT CAAGGCCCCG GTGTTCGCCG CCAAGGTGGC CAAGGTCGAT
CTGGCGCCGG CCAAGGCGGT CAAGGGCGTG CGTCACGCCT TCGTGGTCGA TGGCGGGACC
GAACTGGAGG GCCTGCTGGG CGGCGTCGTC GTGGTCGCCG ACACCTGGTG GGCGGCGCGC
AAGGGCCGCG ACGCGCTGGA GGTGACCTGG GCCGATCACC CGACCAGCGC CCAGAGCAGC
GCCGGCTTCG TCGCCAAGGC CGCCGAGTTG CACGGCCAGC CGCCGCACCG CAAGGTCAAG
GCGGTCGGCG ACGTCGACGC CGCGCTGCGG GGCGCGGCCA AGCTCGTCCG GGCGGACTAC
AGCTACCCGT TCAACGCCCA CGCCACGCTC GAGCCGCAGA ACTGCACGGC CAGCTTCAAG
GACGGCAAGG TCGAGATCTG GGCCCCGGCC CAGAACCCCG AGAACGGCCG CGAGCTGGTG
GCCAAGACCC TGGGCGTCAA GCCCGACGAC ATCACCATCC ACTTCACCCG CAGCGGTGGC
GGCTTCGGCC GGCGGCTGAT GAACGACTTC ATGGTCGAGG CGGCGTGGAT CTCCAAGGTG
GTCGGGGCGC CGGTCAAGCT GCTGTGGACC CGCGAGGACG ACATGCAGCA CGACTTCTAC
CGGCCGGCCG GCTTCCACCG GCTGACGGCG GGGCTGGACG CCCAGGGCGG GCTGACCGCC
TGGCGCAACC ACTTCGTGAC CTTCGGCGAG GGCGACAAGT TCGTTCGAGC CGCCGGCATG
AACGCCACCC AATTCCCGTT CGGGGCGGTC GATAACTATG CCCTGGACGT CTCGGTCATG
CCGCTGGGCG TGCCGACTGG CTGGCTGCGG GCTCCGGGCA ACAACGCCTA TGGCTTCGTG
CTGCAGGGCT TCACCGACGA GGTGGCCCAT GCGGCGGGCG CCGATCCGGT GGCGTTCCGT
CAAAAGCTGC TGGGCGAGCC GCGGCTGATC GGCGAGCCGG GCAAGGGCGA CAGCTTCCAC
ACGGGCCGCA TGCGCGCCGT GCTGGACCTC GTCGCCGACA AATCCGGCTG GGGCCGCAAG
ACCCCCAAGG GCGTCGGCCT GGGCGTGGCC TGCCACTACA GCCACCGGGG CTATGTGGCG
GTGGTGATGG AGGTCGCGGT CGCCGACGGC AAGCCCCGGG TCCGCAAGGC CTGGGCGGCG
GTCGATGTCG GCCGGCAGAT CGTCAATCCG AACGGCGCCG AGCAGCAGGT GCAGGGCTCG
GTCCTGGATG GGATCAGCCT GGCGCTGGGG CAGGAGATCA CCATCGAGAA CGGGGCTGTG
ACCCAGAGTA ATTTCGGCGA CTTTCCGCTG CTGCGCTTCG CCGACGCGCC GGACGTCGAG
GTGCATTTCG TCACCAGCGA CAACCCTCCC ACCGGCCTTG GCGAGCCCGC CCTGCCGCCG
TCGCCGGCAG CCTTGGTCAA CGCCATCTTC GCGGCGACCG GCAAACGGAT ACGCGCCCTG
CCGATCGGCG ACCAACTGGC CTGA
 
Protein sequence
MTAIVLDNPS RRLFLKTGAA AGGGLLLSFN LPGLVRAEGE AAANPFNAFV RIAPDGVVTI 
AAKRPEIGQG IKTSMPMLIA EELDVDWSSV RIEQAPIDAK VFGEQSAGGS TSTPDDWDPM
RQVGAVGRAL LIQAAAQRWN IPAAGLATEP GFVIDKAAKR RASYGELAQA AASLAAPDPK
TLTLKDPKTY RIIGKPQPQS DTPAIVTGQP LYGIDVRVPG MLYATFLKAP VFAAKVAKVD
LAPAKAVKGV RHAFVVDGGT ELEGLLGGVV VVADTWWAAR KGRDALEVTW ADHPTSAQSS
AGFVAKAAEL HGQPPHRKVK AVGDVDAALR GAAKLVRADY SYPFNAHATL EPQNCTASFK
DGKVEIWAPA QNPENGRELV AKTLGVKPDD ITIHFTRSGG GFGRRLMNDF MVEAAWISKV
VGAPVKLLWT REDDMQHDFY RPAGFHRLTA GLDAQGGLTA WRNHFVTFGE GDKFVRAAGM
NATQFPFGAV DNYALDVSVM PLGVPTGWLR APGNNAYGFV LQGFTDEVAH AAGADPVAFR
QKLLGEPRLI GEPGKGDSFH TGRMRAVLDL VADKSGWGRK TPKGVGLGVA CHYSHRGYVA
VVMEVAVADG KPRVRKAWAA VDVGRQIVNP NGAEQQVQGS VLDGISLALG QEITIENGAV
TQSNFGDFPL LRFADAPDVE VHFVTSDNPP TGLGEPALPP SPAALVNAIF AATGKRIRAL
PIGDQLA