Gene Caul_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1340 
Symbol 
ID5898795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1422573 
End bp1424132 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content60% 
IMG OID641561827 
Productputative transcriptional regulator 
Protein accessionYP_001682968 
Protein GI167645305 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGC GAAGTATCAC GGACGACGAA ATCGCTCTGA TCAAGGCGAT GAAAAAACGT 
GGCATGGCCA ACAAGGACAT CCAGTTCTTT TTCAATCGCC CGGGGAGGGC GGTAAATTCA
GGTCGAATTT CGGATATCGG AAAGGGTACC TACAGCGATT GCGCGACCAT TCAGGCTGCC
TCCGATCTCG AACTGGATGG GTTCCTGTCG TCGTTCGATT CGGCAGGGGT GAGCGCCCAG
ATCCAAGTTC CGGCGGCCTC TTCGATAGAA GCCCCGGAGA CGGGGCCGGT CGATCCGAGG
GCGATCAATC GGCTGTTTGC GAAGGGGCCT GATGGGGTCT GGCGGCTCGC GCATGGCGAG
GCAGACGACC TTGAGTGCAA GGCCAACTTT GGTTTCAAGC ACTCGCACAA TTGGATCAGG
GCGATTGCTG CGCTGGCCAA CAATCGAGGC GGCTACGTGC TGTTCGGTGT CGCCGACAAA
GGGACCATCG GTCCTGCTGG CGAAGATCAT AGCCATGCTG TCGTGGGGCT GGGCTCGAAG
GACTTCGAGA ACGCCGACCC CGCTGACATC ACCAATAAGC TGCGTTCCTG CCTGGACCCG
ACCCCGCGCA TTCAGATCCG TACCCATGTG CTGGCAGGGA TCACGATTGG CGTCATCCAC
GTCGAACAGC ATCCGAGCCG GCCCGTCATC GTCACGAAAA CCGAAGGCGA CAAGATCAAG
GAAGGCGATA TTTTCTTCCG CTATCCGGGG CAGTCCATCC GCATAAAATA TAGCGATCTG
CGAGCCATCC TTGATGGGCG TGATGCGGCG GCACGTCGCG AAATCATGCC GGCCGTCGAG
CGTCTGCTGA CGCTTGGCCC GAGGCGAGCG CTCATCGCGG ATTTGGATGA GGGCACGCTT
GGCAGCGGTG GAAACCTCAT CACGATCGAC CCCGAGTTGG TGAAGCAGAT TCAGTTCGTG
AGGGAAGGGG ACTTCAAGCA GGCGGATGGC GCTGCCGCCC TTCGGCTTGT CGGTGATGTG
GTCACCGCGA CCACGACGGG TAAAATCGAG AGCCGTGGGG TTGTGACCGA CAAAATCCTG
CTTCGGAATT TCCTAGCTCG GGAAACCATC GCGAGCCCGA CGGAGTACGT GCGATTCGCT
ACCGAGGGCG CTCACGCGGC CTGGCTGCCG ATTTTCTATT TCGCGAACGC GGCAAAGCTG
GACCGAGCCG CGTTGGAGGC GTTTGTCGGG AAGCTGGATG GCGCGAAGGC AAAAAAGGAT
AGGCTGCTGA GGCGCATCAA AGGGGATGTC AGTGCATACG AACGATACAC GGGTACGCCC
GCCACTATCC TCAAGAAGCT TATGGCAGGC GACGACGTGA CGCCGACAGA CGCCAAGGGA
GCGATCAACG CCGCCATGGC TGTGTGCGGT CTGAAGGATG GTCACGGGCT TGCGGCCGAC
CAACTCCTTA GCCTTCTAAC AAGCTGCCTC GCCTTGGTGG AGGCTACGCC GACAGCGCAT
GGGCTAAGTC ATGTGCGGCG TGCCGCTTGT CGCCTTGATG AGCTATTTTA TCCGCTGTGA
 
Protein sequence
MAKRSITDDE IALIKAMKKR GMANKDIQFF FNRPGRAVNS GRISDIGKGT YSDCATIQAA 
SDLELDGFLS SFDSAGVSAQ IQVPAASSIE APETGPVDPR AINRLFAKGP DGVWRLAHGE
ADDLECKANF GFKHSHNWIR AIAALANNRG GYVLFGVADK GTIGPAGEDH SHAVVGLGSK
DFENADPADI TNKLRSCLDP TPRIQIRTHV LAGITIGVIH VEQHPSRPVI VTKTEGDKIK
EGDIFFRYPG QSIRIKYSDL RAILDGRDAA ARREIMPAVE RLLTLGPRRA LIADLDEGTL
GSGGNLITID PELVKQIQFV REGDFKQADG AAALRLVGDV VTATTTGKIE SRGVVTDKIL
LRNFLARETI ASPTEYVRFA TEGAHAAWLP IFYFANAAKL DRAALEAFVG KLDGAKAKKD
RLLRRIKGDV SAYERYTGTP ATILKKLMAG DDVTPTDAKG AINAAMAVCG LKDGHGLAAD
QLLSLLTSCL ALVEATPTAH GLSHVRRAAC RLDELFYPL