Gene Caul_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2537 
Symbol 
ID5899992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2750925 
End bp2752808 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content68% 
IMG OID641563028 
ProductABC transporter related 
Protein accessionYP_001684162 
Protein GI167646499 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.265447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCAGA TAAACGACCT GACCTTCGAC GCCTGGGGTC GGCGGTTCTT CGACCACGCC 
AGCGTGAGCC TGCCGCCCGG CGCCAAGGTT GGCCTGATCG GCCGCAACGG CGTGGGCAAG
TCCACCCTGT TCAAGCTGAT CCTCGGCCAG CTGCACGCCG GCGACGACGA GATCAGCCTG
CCCAAGGCCG CCCGCATCGG TTCGGTGGAC CAGGAACACC CGGCCACGCC CTTCACCCTG
CTGGAGACCG TGCTGGAGGC CGACGAGGAG CGTCACCAGC TACTGACCAG CCTCGACACC
GCCGAGCCCG AGGAGATGGG CGAGATCTGG GCCCGGCTGA TCGAGATCGA CGCCGACGCG
GCGCCCGCCA AGGCCTCGGA AATCCTGGTC GGCCTGGGCT TTTCCCAGGA AGACCTCTCG
CGGCCGATGA GCGAGTTCTC GGGCGGCTGG CGGATGCGCG TGGCCCTGGC CGCGGCGCTG
TTCGCCGAGC CGGACATGCT GCTGCTGGAC GAACCGACCA ACTATCTCGA CCTGGAAGGC
GCGCTTTGGC TTGAAGCGAG GCTCCAGAAG TACCCGCACA CCGCCCTGAT CGTCAGCCAC
GACCGCGAGT TGCTCAACAA CAGCTGTACG CACATGCTGC ACCTGGCCGG CGGCAAGCTG
GAGCTCTACA CCGGCGGCTA CGACGACTTC GAGCAGCGCC GCGCCGAGAA GGCTCGCCTG
CAGCTATCGG CCAAGGCCAA GCAGGACGCC GAGCGCGCCC ACCTGCAGGC CTTCGTCGAC
CGCTTCAAGG CCAAGGCCTC CAAGGCCGCC CAGGCCCAGT CCCGCATGAA GCGACTGGCC
AAGATGCAGC CGGTGGCCAC AACGATTGAG GAGCGCGTCG CGCCCTTCAC CCTGCCCTCG
CCGCCGCGCC CGCTGGCTCC GCCGCTGATC CGCCTGGAAC GGGCCAATGT CGGCTATGAA
GCCGACCGCC CGATCCTCAA GAACCTCAAC CTGCGCATGG ACCTGGACGA CCGCATCGGC
CTGCTGGGCG TCAACGGCGC GGGCAAGTCG ACCTTCGCCA AGATGATCGC CGGGGCGCTG
AAGATCCAGT CCGGCGAGCT GCACCGCGAC CGCAAGATGA AGGTCGGCTG GTTCCACCAG
CACCAGATCG AGGCGATGGA TCCGGACGAC ACCCCGCTGG AGATCATCCG CCGGGCCATG
CCGGACGCTT CGGAAAGCTC GCGGCGCTCG AAACTGGCGC AGTTTGGCCT GGGCTTCGAG
AAGCAGGAGA CCACGGTCGC CAACCTGTCG GGCGGAGAGC GGGCCCGCCT GCTGCTCAAC
ATGGTGGCGA TGGACGCGCC GCACATGCTG ATCCTCGACG AGCCGACCAA CCACCTGGAC
ATCGACAGCC GCCGGGCCTT GCTGGACGCG CTGAACGACT ACGAGGGGGC GGTGATCCTG
ATCACCCACG ACCGCTCGCT GATGGAGCTG GTCGCCGACC GCCTGTGGCT GGCCGCCGAC
GGCACTGTCA GGCCCTTCGA CGGCGACATG GACGAATACG CCAAGTTCGT GCTCGACCGC
GCCAAGCAGT CGGTCAAGCC GACACAGGTC GACCGCGAGC CGGCCAAGGC CGAAACGCCC
GCCCCAACGG CTTCTGGCGC GCCGAAGAAG ACCGTCGCCA TCGAGCCCCT GAAGCGCAAG
ATCGAGGCCG CCGAGCAGGT CGTCACTCGC ACGACCCGCC AACTCGCCGA GTTGGAGGCC
CAGTTGGCCG ATCCGCAGCT CTACAAGAAC CCGGCCAAGG TCGCCGAACT GACCAAGCGC
CGCGACAACG CCAAGGCCAA GCTGGACGAA GCCGAGGCGA CCTGGATGGG CCTGGCCGAG
GAGTTGGCGG CGGCGGAGGC CTAG
 
Protein sequence
MLQINDLTFD AWGRRFFDHA SVSLPPGAKV GLIGRNGVGK STLFKLILGQ LHAGDDEISL 
PKAARIGSVD QEHPATPFTL LETVLEADEE RHQLLTSLDT AEPEEMGEIW ARLIEIDADA
APAKASEILV GLGFSQEDLS RPMSEFSGGW RMRVALAAAL FAEPDMLLLD EPTNYLDLEG
ALWLEARLQK YPHTALIVSH DRELLNNSCT HMLHLAGGKL ELYTGGYDDF EQRRAEKARL
QLSAKAKQDA ERAHLQAFVD RFKAKASKAA QAQSRMKRLA KMQPVATTIE ERVAPFTLPS
PPRPLAPPLI RLERANVGYE ADRPILKNLN LRMDLDDRIG LLGVNGAGKS TFAKMIAGAL
KIQSGELHRD RKMKVGWFHQ HQIEAMDPDD TPLEIIRRAM PDASESSRRS KLAQFGLGFE
KQETTVANLS GGERARLLLN MVAMDAPHML ILDEPTNHLD IDSRRALLDA LNDYEGAVIL
ITHDRSLMEL VADRLWLAAD GTVRPFDGDM DEYAKFVLDR AKQSVKPTQV DREPAKAETP
APTASGAPKK TVAIEPLKRK IEAAEQVVTR TTRQLAELEA QLADPQLYKN PAKVAELTKR
RDNAKAKLDE AEATWMGLAE ELAAAEA