Gene Caul_5184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5184 
Symbol 
ID5897422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp98022 
End bp102014 
Gene Length3993 bp 
Protein Length1330 aa 
Translation table11 
GC content69% 
IMG OID641555287 
ProductXRE family transcriptional regulator 
Protein accessionYP_001676618 
Protein GI167621833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.219176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGA TTCCGACCTA TCGCGAGCTT GGCCTGCTGA TCGCCCAGCA GCGGTTCAAA 
CAGAAGATGC CCAAGCAGGC TGATCTGGCC GACCGTCTGG GCGTCAAGCA ACAGTCGGTC
AGCCGTTGGG AAGCTGGAAC CCATCGTCCG GGCGTGGACC AACTGCCCGC CCTGGCCACG
GTCCTGGGCG AGGACCTCAG CGAGCTGCGG CGCTTGGCGC GCTATGATGA CCTGCCCGTC
TCTCCCATCC TCGAACCTTT CCCGATCGAT CGTCTCGATC CGGTGACCTT CGAAGCGTTC
GTGGCCTATT TCGCCAAGGC GCTGTACCCG GATGATGATG TGCGCCGGCT CGGGGCCAGC
GGTCACAAGC AAGACGGCGG CGACGTCGTG GTCACCCGGC CCCACGGTTC CGTGCTGATC
CAGTGCAAGC GGGTGGAAAC CTTTGGGCCG GCCGATGTGC GCCGGGCGGT CGAGGCCGCC
GCCGGGTTCG CGGCCCAGGA GAAGGTTCTG GCCTTGAGCC GAACCGCCAG CCCCGCCGCC
GCGGCCGCGG TCGCGGCGGC GGGCTGGGTG CTGTGGGACA AGGACTTCGT CAGTCGGGAA
ATCCGCCACC TTGCGCCGGA GGATCAAGAG CGCCTGGTGG ATATCTTCTT CCGCGGGCAG
CGACCGGCGC TTCTGGGCCG GCCCGAGCCT GGCCCCTGGC GCACGGTGGA GGAGTTTTTC
CGGCCGTTCG AGGATCGCGA CAAGGTGCCC AGCCACAGCT GGACCCTGCT GGGTCGCCAG
GAAGACGTCG ATCGGATCCT CGCACTGCTG GAGAGCGCCC AGACGGGCCT GACCCTGCTG
TCGGCGGCCG GCGGCATGGG CAAGTCGCGC CTGTTGCGCG AGATCGCCGA GCGCGCGGGC
GCCGCCGCGC CCCGGACACG CGTGCGGTTT TTGCTCGGCA CAGAGGACGT CAATCGCGCC
CAGCTGGAAG CACTGGGCGC GGGCGCCAAG CTGCTGATCG TCGATGACGC TCATGACCGC
GATGGTCTGG GCGCGCTGCT GGCCTATGCG GCCGATCCCG ACAACAACAC CCGGCTCTTG
CTGGCCGCCC GGCCCTACGC GATCCAGCGC ATCCGCCGCG AGGCCGCCGT GCACGGCCTG
GCCTGCCAGG AGCATGAGCT CTCGCCGCTC ACCCACCCCC AACTCCTGGA GCTGGCGGGC
GCGGTGCTGA ACAACTTCGG GGCGCCGGCC CATTGGGCCG AGATGATCGT CGAGGCGGCC
GACCGCTCGC CGCTGATCGT GGCGATGGGC GCGCGTATCG TCGCCAGAGA CGCCGTGCCG
CTGGAGCTGG CCAAGAACCA GCCGGCCATG CGCGACTATG TGCTGGGCAA GTTCACCAAG
GTGTTGGCCG GCGACCTGGG CGAAGGGCGC GAGGACGCCT ATCGCCGCGT TTTGGACGTG
CTGGCCCTGG TCCAGCCGTT TCATCCGGAC GATCCCCAGC TTCTAGGCCT GATCGAAATC
CTGACCGGCG TCGAGGCCCA GGAGGCCGAG CGGGCGGTGC GGGCCTTGAT CGAGGGCGGG
GTGGCCTATC CACGCGCGCG CCAATACCGG CTGATGCCCG ACGTCTTGGC CGACTACCTG
ATCGAGAGCC GTTGCCTGGA CGGCACACGC CTGTCGGCGT TCGCCCAGCG GGCCCTGACC
GAAACGCCCC CATCCCTGCT CACCAACCTG ATGGTCAATC TGGGCCGGCT CGATTGGCGG
CGCAGCGCCG GCGACACCAC GCGCAGCACC CTGCTGGAGG CGGCCTGGCG GCGTCTGGAG
AATGTCGAAT ACCACTGGGA CGACCGGGTT GAGGCGGCCA AGGCGGTGGC GATGTTCCAG
CCGCGCCAGG CCCTGGACTA TGTCAGCCGC ATGGCCCGCG CCGGCAAGGC CCTCTCGGCG
CTGCCCGACA TCCTGCGCAA TGTCGCCCTG GTCGAGGATT TCTTCGAGGA GGCCGCCGAA
CTGCTCTGGG AGCTGGCGCG CACCGACGAT CGCGAAACCA ACCCCTTTCC CTCCCACCCG
GCCCGTGTCC TGTCCGACAT GGGCGAGTAC CATCCGCGAA AGCCGGTGGT CTTCTCGCAG
AAGGCCTTGG CGCTGGGCCT GCGGCTGGCC CGCGACGAGA CCCAGTGGGA CGGTCGCTAC
ACCCCCTTCG AGATCCTGCG GCCCTTGATG GCCCTGGAAG GCACCACCCA TCGCAGTGAC
GGGCGCGCCA TCCACTTGAG CAGCTTCTTC GTGGACTACG CCGTGGTCGC GCCGATCCGC
GCGGCGATCG TCGACCTGAT CCTGGAGCTG ATCGTCCACC CCAATCTCAA GATCGCCCAC
CTGGCCGCCG CCGAGCTGGA GCATGTGCTG CGCTACCCGC GCGGCGCATT GGGCGCCGTC
AGCCCCGAAG GCTTGGCCAT CGTGGTCGAA GGCGAGTTCG TCCAGGTTCT GCGCCGCTTG
CGCGACACCG TCCCCCGCGT CGCCCCCACC ACGACGGTCA CCATCTCCCG CACGGTGCGT
TGGATGGCCC GCCATGGCCG CAACGGCGCG GCCGAGGCCG CCCGCGAGGT CCTGGCCGCT
TTTCCCAGCG GGCCGGATTA CGAGCTGCTG ATCGCCTTGG CCGCCGGCCA CGCCATGGAC
GCCGAAGACG AGTTCGGGCC GACGTTCCGC GAAGCCGCCG GCCGCTTCAT CGACAAGGTC
GCCAGCCGGC TGGAAGCGGA AATTCCCGAG CCTGAACCGC GCCGCCAGAA AATCGAGGCG
GTGCTGACCG AAATCATCGC CTCGGCGCTT CCCAGCGCCT TGGAGCATGT TCTGCCTGCT
CGGCTGATGG AGCGCGATCC GGCCTTCGCG GAGGCTCTGG TCGTGGACGC TTTGGCCCGC
CCGGCCTCGG CCACCGCGCG GTTTGCCGGG CATGGCCTTT GGCGGCTCCT CAAGGACGGC
CCAGAGGCGG CGGCCATGGC CTACTTGACC CAGGCCGTGG AGGGGCCGCT TGACCTTCAG
CTGGCCGCCG CCCAGGCCCT GGGCGGGCGC GGTGATCCCA CGCCCGGCGA GGTGGCCCTG
CTGCGCCGAT TGCTGGCCAG CAACGAGCCC GTGGTGGTGT CGGCCGCGAC CCGCGCCTTG
TGGAGCCTGC GCTCGGACGA TCCGCTGCTG GCGTTGGATC TGGTGCTGGC CGCCAATCTC
AGCGAGGACC GCCGGGTGCT CGACGACATG ATGATGGCCC TGTGCGGCGA GCCGCCGGCC
GTGGGCATGA TCGACGCCGA GCGCGCCCAG GCCCTGTTGG ACAAGCTCGA GCCGATTCCG
CGTCTCGAAG GCTACTGGGT CAACAAGGTG CTGGCCGAGC TGTCCTATCG TTTTCCGTTC
GAAACCGCCG AGTTCTTCCG GCGGCGGGTC GAGCGGGCCG CCGCGGCCGA GGAACGCGGC
TTTCGCCCGG CAAACCACGG CCCCTACAGC CACGACGCCT TGCGGTTCAT CGAGACCGAC
GCCGGACCGG AGATCTTCAG CCGGATGTGG GCCTGGCTGC GCGCCAACCA TGCGCTGCGG
GGCCGGTTCT CCTACGCGGC TGGGACCGTG TTCGAGGCGA TGTTCCTGCT CGACGACGCC
TTCGCCGCGC GCGCCTTTGA CGCTCAGATC GCCGACATGA CCCTCGAGGA CATGGAACTG
GCGGCCGGCA TCCTGGCCAA CGGCTCGCCG GACTTCATCT TCGGCCAGGC CGATTTCGTG
GTCCGGTTCA TGAACCGCGT GCAGATGTTG GATCCGACCC AGGTTGAACC CCTGGCGCGG
ACGCTGAGCG CCAGCTCGCG CACGGGGGTG CGCAGCGGCT TGGTCGGCGT CCCGACCGCC
GAGGACGTGT CCGAGCGCGA CCGCTCGCTC GAGATGCTCG CCCGCCTGCC CCGGCTCTCG
CCCGCCCGAG CTGTCTACGA AGACGCGCTG GGCCACGCCC AATGGGGGAT CGAACGCACG
TTGCGGGATG CGGAGGCGTT GGACCAGTCG TGA
 
Protein sequence
MKPIPTYREL GLLIAQQRFK QKMPKQADLA DRLGVKQQSV SRWEAGTHRP GVDQLPALAT 
VLGEDLSELR RLARYDDLPV SPILEPFPID RLDPVTFEAF VAYFAKALYP DDDVRRLGAS
GHKQDGGDVV VTRPHGSVLI QCKRVETFGP ADVRRAVEAA AGFAAQEKVL ALSRTASPAA
AAAVAAAGWV LWDKDFVSRE IRHLAPEDQE RLVDIFFRGQ RPALLGRPEP GPWRTVEEFF
RPFEDRDKVP SHSWTLLGRQ EDVDRILALL ESAQTGLTLL SAAGGMGKSR LLREIAERAG
AAAPRTRVRF LLGTEDVNRA QLEALGAGAK LLIVDDAHDR DGLGALLAYA ADPDNNTRLL
LAARPYAIQR IRREAAVHGL ACQEHELSPL THPQLLELAG AVLNNFGAPA HWAEMIVEAA
DRSPLIVAMG ARIVARDAVP LELAKNQPAM RDYVLGKFTK VLAGDLGEGR EDAYRRVLDV
LALVQPFHPD DPQLLGLIEI LTGVEAQEAE RAVRALIEGG VAYPRARQYR LMPDVLADYL
IESRCLDGTR LSAFAQRALT ETPPSLLTNL MVNLGRLDWR RSAGDTTRST LLEAAWRRLE
NVEYHWDDRV EAAKAVAMFQ PRQALDYVSR MARAGKALSA LPDILRNVAL VEDFFEEAAE
LLWELARTDD RETNPFPSHP ARVLSDMGEY HPRKPVVFSQ KALALGLRLA RDETQWDGRY
TPFEILRPLM ALEGTTHRSD GRAIHLSSFF VDYAVVAPIR AAIVDLILEL IVHPNLKIAH
LAAAELEHVL RYPRGALGAV SPEGLAIVVE GEFVQVLRRL RDTVPRVAPT TTVTISRTVR
WMARHGRNGA AEAAREVLAA FPSGPDYELL IALAAGHAMD AEDEFGPTFR EAAGRFIDKV
ASRLEAEIPE PEPRRQKIEA VLTEIIASAL PSALEHVLPA RLMERDPAFA EALVVDALAR
PASATARFAG HGLWRLLKDG PEAAAMAYLT QAVEGPLDLQ LAAAQALGGR GDPTPGEVAL
LRRLLASNEP VVVSAATRAL WSLRSDDPLL ALDLVLAANL SEDRRVLDDM MMALCGEPPA
VGMIDAERAQ ALLDKLEPIP RLEGYWVNKV LAELSYRFPF ETAEFFRRRV ERAAAAEERG
FRPANHGPYS HDALRFIETD AGPEIFSRMW AWLRANHALR GRFSYAAGTV FEAMFLLDDA
FAARAFDAQI ADMTLEDMEL AAGILANGSP DFIFGQADFV VRFMNRVQML DPTQVEPLAR
TLSASSRTGV RSGLVGVPTA EDVSERDRSL EMLARLPRLS PARAVYEDAL GHAQWGIERT
LRDAEALDQS