Gene Caul_1016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1016 
Symbol 
ID5898471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1075801 
End bp1077168 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content70% 
IMG OID641561498 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_001682644 
Protein GI167644981 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.963955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTCC TGGTCGTCGG AAAACTGAAC GGACAGCTCT CGGTCGCCGT GAAGATGGCG 
ATGAACACCG GGGCCAAGGT CTCGCATGTC GAGACCATCG AGGCGGCGAC CCACGCCCTG
CGGGCCGGGC AGGGGGCCGA CCTGCTGATG GTCGACTACG CGCTCGACAT CGCGGGCCTG
ATCGCCGCCA ACGAGTCCGA GCGGATCCGG GTGCCGGTGG TGGCCTGCGG CGTCGACGCC
GACCCCATGC GGGCCGCCGC GGCGATCAAG GCCGGGGCCA AGGAATTCAT CCCGCTGCCG
CCGGACGCCG AGCTGATCGC CGCCGTCCTG GCCGCCGTGA CCGACGACAA CCGTCCGATG
ATCGTCCGCG ATCCGGCCAT GGGCGACGTC ATCCGCCTGG CCGACCAAGT CGCGGGGTCG
GAAGCCTCGA TCCTGATCAC CGGCGAAAGC GGCTCGGGCA AGGAGGTCAT GGCCCGCTAC
GTCCACGCCA AGTCGCGTCG GGCCAAGGCG CCGTTCATCT CGGTCAACTG CGCCGCCATC
CCCGAGAACC TGCTGGAAAG CGAGCTGTTC GGCCACGAGA AGGGCGCCTT CACCGGCGCC
GTGGCCCGCC GCATCGGCAA GTTCGAGGAA GCCAATGGCG GCACGCTACT GCTGGACGAA
ATCAGCGAGA TGGACACCCG CCTGCAGGCC AAGCTGCTGC GCGCCATCCA GGAGCGCGAG
ATCGACCGGG TCGGCGGCTC AAAGCCGGTC AAGGTCGATA TCCGCATCCT GGCCACCTCC
AACCGCGACC TGACACAGGC GGTGAAGGAC GGCACGTTCC GCGAGGACCT GCTCTACCGT
CTCAACGTCG TGAACCTGCG CCTGCCGCCG CTGCGCGACC GCCCGGGCGA CGTCATCACC
CTGTGCGAGC ACTTCGTGAA GAAGTACTCG GCCGCCAACG GCTTGCCGGA AAAGCCGATC
GCGGCCGAGG CCAAGCGCCG GCTGATCGCC CACCGCTGGC CGGGCAACGT CCGCGAGCTG
GAGAACGCCA TGCACCGCGC GGTGCTGCTG TCGCCGGGCG CCGAGATCGA GGAGTTCGCC
ATCCGCCTGC CGGACGGCCA GCCCCTGGCC CCGGCCCCGG ACGTCGCGGT GGCCCGCGGC
GCCCAGATGG CGGCCGACGC CGTCTCGCGC ACCTTCGTCG GCTCGACCGT GGCCGAGGTC
GAGCAGCATC TCATCATCGA AACGTTGGAG CACTGCCTGG GCAACCGCAC CCACGCGGCC
AACATCCTGG GCATCTCGAT CCGCACCCTG CGCAACAAGC TGAAGGAATA TTCCGAAGCC
GGCGTCGCCG TGCCCGCCCC GCAAGGCGGC GTGACCAACG CGGCCTGA
 
Protein sequence
MRLLVVGKLN GQLSVAVKMA MNTGAKVSHV ETIEAATHAL RAGQGADLLM VDYALDIAGL 
IAANESERIR VPVVACGVDA DPMRAAAAIK AGAKEFIPLP PDAELIAAVL AAVTDDNRPM
IVRDPAMGDV IRLADQVAGS EASILITGES GSGKEVMARY VHAKSRRAKA PFISVNCAAI
PENLLESELF GHEKGAFTGA VARRIGKFEE ANGGTLLLDE ISEMDTRLQA KLLRAIQERE
IDRVGGSKPV KVDIRILATS NRDLTQAVKD GTFREDLLYR LNVVNLRLPP LRDRPGDVIT
LCEHFVKKYS AANGLPEKPI AAEAKRRLIA HRWPGNVREL ENAMHRAVLL SPGAEIEEFA
IRLPDGQPLA PAPDVAVARG AQMAADAVSR TFVGSTVAEV EQHLIIETLE HCLGNRTHAA
NILGISIRTL RNKLKEYSEA GVAVPAPQGG VTNAA