Gene Caul_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1950 
Symbol 
ID5899405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2089984 
End bp2092749 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content67% 
IMG OID641562440 
ProductABC transporter related 
Protein accessionYP_001683577 
Protein GI167645914 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.62728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCC GACCCGGCCC GGACGCAACG CCTCCGGTCG CCCGGCTCGC GGGGGTCAGT 
CTCGACTATC GGGGCACGCG GGCCCTGGAC AACATCGATC TCGACATCCC GGCGGGCCGG
ATGGTGGCGC TGATCGGCCC CGACGGGGTG GGCAAGTCGA CGCTGTTCTC GCTGATCGCC
GGCGCGCGGG TCATCCCCTC GGGCACGGTG GAGGTGCTCG GCGGCGACAT GGCCGACCGT
CGGCACCGCG AGGCGGTCTG TCCTCGCATC GCCTACATGC CCCAGGGGCT GGGCAAGAAC
CTCTATCCGA CGCTGTCGGT CTTCGAGAAC CTGGACTTCT TCGGCCGGCT GTTTGGCCAG
GACGGGGCGG AGCGTGAGCG CCGGATCACC GACCTGCTGG ACAGCACAGG TCTCGCACCC
TTCAAGGACC GGCCCGCCGG CAAGCTGTCC GGCGGCATGA AGCAGAAGCT GGGCCTGTGC
TGCGCCTTGA TCCACGATCC CGACTTCCTG TTGCTGGACG AGCCGACGAC GGGGGTCGAT
CCGCTGTCGC GGTCGCAGTT TTGGGATCTC ATCGACCGCA TCCGCGCCGC GCGCCCCGGC
ATGAGCGTGT TGGTGGCCAC CGCCTATATG GAGGAGGCTG CGCGGTTCGA TTGGCTTGTG
GCCATGGACG CTGGCAAGGT CCTGGCGACT GGCTCTCCCG ACGCGCTCTA TCAACGCACC
GGCACGACCG ATCTCGAACA GACCTTTATC GCCCTGTTGC CCGAGGAACG CCGGCGTGGG
GCCCAGCCCG TGACCATTCC GCCGCGTCTG CACGCGGGCG AGGGCGACAT CGCCATCGAG
GCGAAGGGCC TGACCATGCG GTTTGGGGAT TTCACCGCCG TCGATCAGGT GGATTTTCAG
ATCGAGCGCG GCGAGATCTT CGGGTTCCTG GGGTCGAACG GCTGCGGCAA GACCACCACG
ATGAAGATGC TGACCGGCTT GCTGCCCGCC ACCGCCGGCG AGGCGCATCT GTTTGGCCGC
CTGGTGAACG CGGGCGACAT CCAGACCCGT CGGCGGGTCG GCTACATGTC CCAGGGCTTC
TCGCTGTATT CGGAACTGAC CGTCGCGCAA AACCTGGAGC TCCACGCGCA GCTGTTCGGG
ATGGAGGCGC GAGACATCCC GGATCGGATG ACGGAGATGG CCGAGCGCTT TGGACTGGCC
GACCTGATGG ACGCGCTACC GGACGCTCTG CCCCTGGGCC AGCGCCAGCG TCTTTCCCTC
GCCGTCGCCA TGATCCACAA GCCGGAAATC CTGATCCTCG ACGAGCCGAC CTCTGGCGTC
GACCCCATCG CCCGCGACCA GTTCTGGCAG AGCATGATCG ACCTGTCGCG GCAGGACGGG
GTGACGATCT TCATCTCCAC CCACTTCATG AACGAGGCCG CGCGCTGTGA TCGCATCTCG
CTGATGCACG CGGGCAAGGT GCTGGTCAGC GATGCCCCGG CGCGCATCGT CGAGCAGCAG
GGCGCCAAGT CCCTGGAAGA GGCTTTCATC AGCTATCTGA AGGCGGCCGA GGGAGACGAC
GCCCCGCCGC CCGCGCCTGC GTCTCCCCCC AGCGACACGG TCGCTCAGCC GCCGCGCCCG
GGGCGCGGCT GGCTGGGCGG CTTTAGTCCT CGGCGGATGA TGAGCTATTC GCGGCGCGAG
GCGCTTGAAC TGCGCCGCGA CCCGATCCGG GCGACCCTGG CCCTGCTGGG CAGTGTGATC
CTGATGTTCA TCATGGGCTA CGGCATCAGC ATGGACGTCA AGGACCTGCC GTTCGCCGTG
CTGGACCGCG ACGGGACGAC CACGAGCCAG GACTACGCGC TCAACATCGC CGGCTCGCCC
TATTTTCTGG AGCGACGGCC GATCTCGGAC TATGGCGAAC TGGATCGGCG GATGCGGTCG
GGCGAACTGA GCGTGGCGAT CGAGATCCCG CCCGGCTTCG CCCGGGACCT GCGCCGGGGG
GACACGGTGG CGCTGGGCGT GTGGATCGAC GGCGCCATGC CCCAGCGCGC CGAAACCGTG
CAGGGCTACA TCCAGGCGCT GCACGCCCAA TGGCTGAGCG ACATGTCCGT TCGAGAGCTT
GGGGTGCGGC CGTCGACCGG CTTGATCAAC ATCGAAACCC GCTATCGCTA CAATCCCGAC
GTCAAGAGCC TGGTGGCGAT CGCGCCCGCG GTCATCCCCG TGCTGCTGCT GCTGTTTCCG
GCCATACTGA CGGCGCTGAG CGTCGTGCGG GAAAAGGAGC TGGGCTCGAT CATCAACTTC
TACGTCACCC CGGTCACGGC ACTGGAATTT CTGCTGGGCA AGCAGGCGCC CTATGTGGCG
CTGGCCATGT TGAACTATGC CCTGCTGGTC GTGCTGTCCC TGTCGACCTT CCGCGTGCCG
ATCACTGGCG ACATCGTGGC CATGACCTTG GGCGCCCTGC TCTATGTCCT GTCCTCGACG
GCGATCGGCT TGCTGTTCTC GATCTTCATG CGCAGCCAGA TCGCCGCGCT GTTCGCCACC
GCCATCGGCA CGATCCTGCC GGCCGTGCAG TTTTCGGGAC TGATCAATCC GGTCTCGTCG
CTGGAGGGAG CCGGGGCGGT CATTGGTCGC GTCTTCCCGA CCAGTCATTT TGTCACCATC
TGCCGCGGCG TGTTCTCCAA GGGACTGGGG TTTCCCGATC TCCAGCCGGA ACTGCTGAGC
CTGGCGGTCA CCTTCCCGCT GCTGCTGGCC GCCTGCGTCA TGTTGCTCAA GAAACAGGAA
CACTAG
 
Protein sequence
MTARPGPDAT PPVARLAGVS LDYRGTRALD NIDLDIPAGR MVALIGPDGV GKSTLFSLIA 
GARVIPSGTV EVLGGDMADR RHREAVCPRI AYMPQGLGKN LYPTLSVFEN LDFFGRLFGQ
DGAERERRIT DLLDSTGLAP FKDRPAGKLS GGMKQKLGLC CALIHDPDFL LLDEPTTGVD
PLSRSQFWDL IDRIRAARPG MSVLVATAYM EEAARFDWLV AMDAGKVLAT GSPDALYQRT
GTTDLEQTFI ALLPEERRRG AQPVTIPPRL HAGEGDIAIE AKGLTMRFGD FTAVDQVDFQ
IERGEIFGFL GSNGCGKTTT MKMLTGLLPA TAGEAHLFGR LVNAGDIQTR RRVGYMSQGF
SLYSELTVAQ NLELHAQLFG MEARDIPDRM TEMAERFGLA DLMDALPDAL PLGQRQRLSL
AVAMIHKPEI LILDEPTSGV DPIARDQFWQ SMIDLSRQDG VTIFISTHFM NEAARCDRIS
LMHAGKVLVS DAPARIVEQQ GAKSLEEAFI SYLKAAEGDD APPPAPASPP SDTVAQPPRP
GRGWLGGFSP RRMMSYSRRE ALELRRDPIR ATLALLGSVI LMFIMGYGIS MDVKDLPFAV
LDRDGTTTSQ DYALNIAGSP YFLERRPISD YGELDRRMRS GELSVAIEIP PGFARDLRRG
DTVALGVWID GAMPQRAETV QGYIQALHAQ WLSDMSVREL GVRPSTGLIN IETRYRYNPD
VKSLVAIAPA VIPVLLLLFP AILTALSVVR EKELGSIINF YVTPVTALEF LLGKQAPYVA
LAMLNYALLV VLSLSTFRVP ITGDIVAMTL GALLYVLSST AIGLLFSIFM RSQIAALFAT
AIGTILPAVQ FSGLINPVSS LEGAGAVIGR VFPTSHFVTI CRGVFSKGLG FPDLQPELLS
LAVTFPLLLA ACVMLLKKQE H