Gene Caul_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1581 
Symbol 
ID5899036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1668784 
End bp1670955 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content68% 
IMG OID641562069 
Productmethyltransferase 
Protein accessionYP_001683209 
Protein GI167645546 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein
[COG4301] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03438] probable methyltransferase
[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.412044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCCCG ACGATACCCC AGGCCGGGAG GCTTCCGAGG CTCGCGCCAT CGCCTTGGTC 
GATCGCTATC TGGCGGTGCG GCGGCGCACC GAGGCCCTGG CCAAGCCGCT ATCTCCCGAG
GATCAGGGCG CGCAATCGAT GCCCGACGCC AGTCCGGTCA AATGGCACCG GGCTCATACC
GCATGGTTCT TCGAAACCTT CCTGCTGACG CCGTTCCTGC CCGGCTACCA GGTCTTCGAC
CCGGCCTTCG CCTATCTGTT CAATTCCTAC TACGAGGCGG TTGGACCCCG GCAGCCGCGA
CCCTTGCGCG GCCTGATCAC CCGGCCCTCG GCCGACGAGA TCGGGGCCTA TCGCGTCCAT
GTCGACGCGG CCATGGTTCG CCTGTTGACG TCTTCGCCGA CCGCCGAGGT GACGGAGCGC
CTGGACCTGG GTCTGGCCCA TGAGGAGCAG CACCAGGAAC TGATCCTGAT GGACGTGCTC
CACCTGTTCG CCCAGTCGCC GCTGCAGCCG GCCTATGGCG ACAAGCCGCC GCCCCAGCGT
TCCGCCGCCG GCAAGCCGCG TTACCTCGGC TTCGAGGGCG GCCTGGTCGA GATCGGCGAC
GATGGTCCGG GCTTCGCCTT CGACAACGAA CGGCCACGCC ACAGGGTGTT CCTGGAGCCC
TACCGCCTGG CCGATCGGCT GGTGACCAAT GGCGAGTGGT TGGCCTTCGT CGAGGACGGC
GGCTATCGCC GCGCCGATCT GTGGCTGTCC GACGGTTGGG CCGCGGTGAA CGAGCAGGGC
TGGGAGGCGC CGCTCTACTG GCGCCGCGAG GCGGGCGAGA CCTGGTCGGT GATGACCCTT
TCGGGCCGAC GTCCGGTCGA TCCCGACGCG CCGGTGGCTC ATGTCAGCTA CTACGAGGCC
GCCGCCTTCG CCGCCTGGTC AGGGCGCCGC TTGCCGACCG AGGCCGAGTG GGAGGCCGCG
GTCGCCGCCC CGGAAGGCCA AGGTTTGCGC CAGACGTCCG ACGAGGCTTG GCAATGGACC
GCCAGTCCCT ACGTGGCCTA TCCGGGCTTC AAGTCGGGGG TGGGGGCGTT GGGCGAGTAC
AATGGCAAGT TCATGATCAA CCAGATGGTT CTGCGCGGTG GCGCGGCCCA GACTCCGCCC
GGCCACACGC GGCCGTCCTA TCGCAATTTC TTCCACCCGG CCCAGCGCTG GGCGTTCACG
GGGGTCCGCT TGGCCGACGA CATGAGTGCG CTGGATCGCG AGACGGCGAC GCCCGTTTCG
ACCTTCCTCG ACGACGCGGT GGCGGGCCTG ACGGCCGAGC GCAAGACGCT GCCGGCCAAG
TACTTCTACG ACGCCGAGGG CTCGCGCCTG TTCGAGGCGA TCTGCGAACT ACCGGAATAT
TATCCCACCC GCACCGAAAC GGCGCTGCTG CGACGGATCG CTCCCGAGAT CGCCGCCCGC
GTCTGCGACG GCGCCGCCCT GGTCGAATTT GGCAGCGGGG CCAGCACCAA GACCCGAATC
CTGCTGGACG CCATGCCGCA ATTGGCGGTC TACGCGCCAA TCGACATCAG CCAGTCGGCG
CTGGACGAGG CCAGCGAGGC GATCCGGCGC GACTATCCAG CCCTGATCGT CGCGCCGCTG
CTCGAAGATT TCACCCGCGC CTTCCGGCTT CCCGCCGCGG CGCGCGGTCG ACCGGTGACC
GGCTTCTTCC CCGGCTCGAC CATCGGCAAT TTCGCGCCCG CCGACGCCGA GGACTTCCTG
CGTGGGGCGC ACGCTCTGCT GGGCGACGGC GCGATGTTCG TTGTTGGAGT CGATATCGCC
AAGGGCCCCG ACGTGCTGGT GCCGGCCTAT GACGACGCCC AGGGCGTGAC GGCGGCGTTC
AACAAGAACG TGCTGGCGCG GATCAATGGC GAGTTGGGCG GTGACTTCGA CCTCGATGCG
TTCGATCATC GCGCGATCTG GAACGCCGAC GAAAGCCGCA TGGAAATGCA TCTGGTCAGC
CGCGTCGAAC AGACGGCGCA TCTGGCGGGG CATGAGATCC GGTTCGCGGC CGGAGAAACG
ATCCACACCG AGAATTCGTA TAAGTACGCC CCGGAAGTGT TCGTGGAACT GGCCCGCCGG
GCGGGCTGGA AGGTCGCGGC GCGCTGGATC AGCGACAGCC CGAGCTTCGG CGTCTTCGCC
CTAGCCGGCT GA
 
Protein sequence
MSPDDTPGRE ASEARAIALV DRYLAVRRRT EALAKPLSPE DQGAQSMPDA SPVKWHRAHT 
AWFFETFLLT PFLPGYQVFD PAFAYLFNSY YEAVGPRQPR PLRGLITRPS ADEIGAYRVH
VDAAMVRLLT SSPTAEVTER LDLGLAHEEQ HQELILMDVL HLFAQSPLQP AYGDKPPPQR
SAAGKPRYLG FEGGLVEIGD DGPGFAFDNE RPRHRVFLEP YRLADRLVTN GEWLAFVEDG
GYRRADLWLS DGWAAVNEQG WEAPLYWRRE AGETWSVMTL SGRRPVDPDA PVAHVSYYEA
AAFAAWSGRR LPTEAEWEAA VAAPEGQGLR QTSDEAWQWT ASPYVAYPGF KSGVGALGEY
NGKFMINQMV LRGGAAQTPP GHTRPSYRNF FHPAQRWAFT GVRLADDMSA LDRETATPVS
TFLDDAVAGL TAERKTLPAK YFYDAEGSRL FEAICELPEY YPTRTETALL RRIAPEIAAR
VCDGAALVEF GSGASTKTRI LLDAMPQLAV YAPIDISQSA LDEASEAIRR DYPALIVAPL
LEDFTRAFRL PAAARGRPVT GFFPGSTIGN FAPADAEDFL RGAHALLGDG AMFVVGVDIA
KGPDVLVPAY DDAQGVTAAF NKNVLARING ELGGDFDLDA FDHRAIWNAD ESRMEMHLVS
RVEQTAHLAG HEIRFAAGET IHTENSYKYA PEVFVELARR AGWKVAARWI SDSPSFGVFA
LAG