Gene Caul_3611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3611 
Symbol 
ID5901066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3894635 
End bp3896641 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content68% 
IMG OID641564122 
Producthypothetical protein 
Protein accessionYP_001685236 
Protein GI167647573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.431512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCCTGG CGCTATGGGC GACGTCGGCC CTGACCTCGT CGCCGGCGCT CGCGGCCTCT 
CCGAGGCTCG ACGCCCACGC CATAGCCGCC GCCCGGTTCG GCAACGACGC GCCCTGGTAC
GAGGACAATA TCCCGCTGTT CGAGTCCTCC GATCGCAAGC TCGACGAGAT CTACTATTAC
CGTTGGAGCG TGTTTCGGGC CCACCAGCGC GACCTTGGAC CGCGCGGCTA CATCACCACC
GAGTTCCTGG ACGACGTCGG CTGGCAGCGC GAGCCCTATG CCAGCCTGAA CGACGCCACC
GGCTTCCACA TCCAGGAGGG CCGCTGGCTG CGCGACCGGC GCTATGCCGG CGACTATGTC
GACTTCATGT ACGAGGGCGG CGGCAACGAC CGCCACTTCG CCGAGGCCAT AGCCGACGCC
ACCTTCGCCC GCTTCCTGGT CGATGGCGAC CAGGACGCCG CCACCCGGCA TCTCGGCGCG
ATGAAGCATA TCTACGCCCT GTGGGACGAC CGCTACGACT TCGACAAGGA GCTCTACTGG
ATCGAGCCGC TGCTGGACGC CACCGAATAT ACCATCAGCT CGATCGACGC CTCGGGGGGC
ACGGACGGCT TCAGGGGCGG CCACGCCTTC CGGCCCTCGA TCAACAGCTA CATGTACGCC
AACGCCCGGG CGATCAGCCG GCTGGCGGCC CTGACCGGCG ACACGGCCAC CGCCGCCGAC
TACGCCGCCC GGGCCGACGA CCTGAAGGCC CAGGTGCAGA AAAGCCTGTG GAGCCAGGAC
TTCGCCCACT TCATCGACCG CTACCAGGTC AACAACGAGC ATGTGAAATA CTGGGACCCG
ATCCGCGGCC GCGAACTGGT CGGCTACCTG CCTTGGACCT TCGGCCTGCC CGACGACACG
CCGGCCTACG CCCAGGCCTG GAAGCACGCG GTCGATCCGA ACCAACTGGC CGGCCCGGCG
GGCCTGCGCA CGGTCGAGCC GTCATATGAG CACTACATGC GCCAGTACCG CTACATCAAG
GAAACGGGCG AGCCCGAGTG CCAATGGAAC GGCCCGGTGT GGCCGTTCCA GACGACCCAG
GTGCTGACGG GCCTCGCCAA TCTGCTCAAC GACTATCGCC AGGACGTGGT CACCCGCTCG
GACTACGCCC GGATGCTGGC CCAGTACACG CGGCTGCACT TCAAGGACGG TAGGCCGGAC
CTGCAGGAAG ACTACGACCC GGCGACCGGC AAGGCCATCG TCGGCCTGGC CCGCAGCCAC
CACTACAATC ACTCCGGCTA TGTCGATCTG GTGATCAGCG GCCTGGTCGG CCTGCGCCCG
CGCGCCGACG ACGTGCTGGA GGTCAACCCC CTGGCGCCCA GCGCCCCGGC GGACCCGAAT
TTCCTGAAAT ATTTCCGCCT GCAGGACGCG CCCTATCACG GCCATCTGGT CGGGATTTCC
TGGGACGCCG ACGGTTCGCG CTACGGCCGC CAGGGGCTGG TGGTGACGGT GGATGGCCAG
GAAGTCACCG CCTCGCCGAC CCTAGCCAGA CTGACCATCC CGCTAGCCCG CAAGACGCCC
GCGCCGATCG CCCGGCCGAT CGATCTGGCG GTCAACCTAG TGCGCTCGGA CTATCCGCGC
GGCTCGGCCT CGACGGGGGC CGACGCCAAC ACCGTGCACC AGGCGCTGGA CGGGCGCGTG
TGGTTCTTCC CCGAGATGGC CAACGGCTGG TCGCCGGGCG CCGGGCAGAA GCAACCCTGG
TTCGCCGTGG ATTTCGGCAA GGCGACGTCG GTGCGTTCGG CCGAACTCAG CTTCTTCGCC
GACGACCAGA CGCTGGCCGC GCCGGCGCGC TATCGACTGG AGGCGTGGAA GGACGGCAAG
TGGGTCGAGG TCGCCAAGGT TCCCTCCCCA CTGGCCAACG GCGTCGACCG CGCGGCCTGG
GCGCCGGTTG TGACCATGAA GCTGCGGGCG GTGTTTGAGC TGCCGCCCGG CAAGGACATG
CGGCTGGTCG AGATGAAGGT GTTCTAG
 
Protein sequence
MALALWATSA LTSSPALAAS PRLDAHAIAA ARFGNDAPWY EDNIPLFESS DRKLDEIYYY 
RWSVFRAHQR DLGPRGYITT EFLDDVGWQR EPYASLNDAT GFHIQEGRWL RDRRYAGDYV
DFMYEGGGND RHFAEAIADA TFARFLVDGD QDAATRHLGA MKHIYALWDD RYDFDKELYW
IEPLLDATEY TISSIDASGG TDGFRGGHAF RPSINSYMYA NARAISRLAA LTGDTATAAD
YAARADDLKA QVQKSLWSQD FAHFIDRYQV NNEHVKYWDP IRGRELVGYL PWTFGLPDDT
PAYAQAWKHA VDPNQLAGPA GLRTVEPSYE HYMRQYRYIK ETGEPECQWN GPVWPFQTTQ
VLTGLANLLN DYRQDVVTRS DYARMLAQYT RLHFKDGRPD LQEDYDPATG KAIVGLARSH
HYNHSGYVDL VISGLVGLRP RADDVLEVNP LAPSAPADPN FLKYFRLQDA PYHGHLVGIS
WDADGSRYGR QGLVVTVDGQ EVTASPTLAR LTIPLARKTP APIARPIDLA VNLVRSDYPR
GSASTGADAN TVHQALDGRV WFFPEMANGW SPGAGQKQPW FAVDFGKATS VRSAELSFFA
DDQTLAAPAR YRLEAWKDGK WVEVAKVPSP LANGVDRAAW APVVTMKLRA VFELPPGKDM
RLVEMKVF