Gene Caul_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3564 
Symbol 
ID5901019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3848929 
End bp3850044 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID641564072 
Producthypothetical protein 
Protein accessionYP_001685189 
Protein GI167647526 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0767] ABC-type transport system involved in resistance to organic solvents, permease component 
TIGRFAM ID[TIGR00056] conserved hypothetical integral membrane protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.777285 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCAC CGGCTGACTT CACCTTCGAG GATCATGAGG GCCGCAAGAC CGTCATGCTC 
TCGGGGGACT GGACCGCCCG CGGCATGGTC GATGCCGGGG AGCGCCTGAT CACGGCCCTG
GACGGCTCGG ACGCCGTCGA TCTGGACCTG CGCGACCTCA GCCGCTGCGA CACCGCCGGC
GCCTACGCCA TCATCCGCGC CGCCGACGGC CGGGTCAGCG CCGGCCACAT CAAGGCCAAC
AGCCGAACCC TGCGCCTGCT GCAACTGGTC GGCGACGCCA TCCAGGTCGA GCCCGAGGCC
GCGCCGCCGC AGAAGGGCTT CCAGGCCCTG CTGGAGCGTA TCGGCCGCGG CGTCTACGGG
CTGGGCGACG ACCTCTACGG CACCCTGGGG TTTCTGGGGC ACCTGCTGGT GGCCATCGGC
CGCTGCATCG CCAAGCCCAG CCGCATCCGC TGGGCGCCGG TGGTCGCGCT GGCCGAGCGG
TCGGGGCTGG ACGCCATCCC GATCGTGGCC GTGACCACCT TCTTCATCGG CGCGGTCGTG
GCCCTGCTGG GCGCCAACCT GCTGACCCAG TTCGGGGCCC AGGTGTTCGC CGTCGAACTG
ATCGGCATCT CGGTGCTGCG GGAGTTCAAC ATCCTGATCA CCGCCATCCT GCTGGCCGGC
CGCTCGGCCT CCAGCTTCGC GGCCGAGATC GGCTCGATGA AGATGAACCA GGAAATCGAC
GCCATGCAGG TGATGGGGGT CGATCCCTAC GAAGCCCTGG TCCTGCCGCG CTTCGCCGCC
CTGCTGATCA CCATTCCCCT GTTGACCTTC ATCGCCACCC TGGCGGGCCT GGCGGGCGGC
ATGCTGGTCA CCTGGGCGGT GCTCGACCTG TCGCCGACCT TCTTCCTGCA GCGGATGCAG
GACTCCGTCG GCGTGCAGCA CTACTGGATC GGCCTGTCGA AGGCCCCGGT GATGGCCATG
GTCATCGCCG CCATCGGCTG CCGCCAGGGC ATGGAGGTCG GCAACGACGT CGAATCCCTG
GGCCGCCGCG TCACCGCCGC CGTGGTCCAC GCCATCTTCG CGATCATCGC CATCGATGCG
GTCTTCGCCC TGATCTACAT GGAGCTGGAC CTGTGA
 
Protein sequence
MGAPADFTFE DHEGRKTVML SGDWTARGMV DAGERLITAL DGSDAVDLDL RDLSRCDTAG 
AYAIIRAADG RVSAGHIKAN SRTLRLLQLV GDAIQVEPEA APPQKGFQAL LERIGRGVYG
LGDDLYGTLG FLGHLLVAIG RCIAKPSRIR WAPVVALAER SGLDAIPIVA VTTFFIGAVV
ALLGANLLTQ FGAQVFAVEL IGISVLREFN ILITAILLAG RSASSFAAEI GSMKMNQEID
AMQVMGVDPY EALVLPRFAA LLITIPLLTF IATLAGLAGG MLVTWAVLDL SPTFFLQRMQ
DSVGVQHYWI GLSKAPVMAM VIAAIGCRQG MEVGNDVESL GRRVTAAVVH AIFAIIAIDA
VFALIYMELD L