Gene Caul_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0771 
Symbol 
ID5898225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp821575 
End bp822753 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content70% 
IMG OID641561251 
ProductGntR family transcriptional regulator 
Protein accessionYP_001682400 
Protein GI167644737 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.25103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT TGTCGATGCG GAGCGGCGAT ATCAGGCCCT CGCCGGTGCG CGACATGCTC 
AACGTGTCCC AGCGGCCCGG CATGATCTCG TTCGCCGGCG GCCTGCCCGC GCCCGAGACC
TTCGCGGGCC TGGAGCTGCC CCCGCCGCCG CGCGACCTGC TGCAATATGG CCCGACCGAG
GGCGAGCCGG CGCTGCGCGA GCGGATCGCC CAGGACTTGG CCGCCTTGGG CCTGGACACC
GAGCCCGACC GCGTCCTACT GCTGTCCGGG TCGCAGCAGG GCATCGACCT GACCGCCAAA
CTGACCATCG ACGCCGGCAC GCGCCTGGCC GTGGAGTCGC CCGCCTATCT GGCCGCCCTA
CAGGTGTTCC GCTTCTATGG CGCGCGCTTC CAGGTCACCG ACCGCGCCGA CCCGGCGGCG
GGCTGGGCAG AGGGGCCGCC GGCCCTGGCC TATGTGATCC CAACCTTCCA GAACCCGACG
GGCCAGTGCT GGAGCGCCGA CGAGCGCCAG GCGATGGCCC AGGCCTGCCA GGCCCACGAC
GTGATCCTGT TCGAGGACGA CCCCTACCGC GACCTGGTCT ACGAACCTTG CGAGCGCCGG
CCGGCCTGCG CGTGGATGAA GACCGGATCC TGGATCTACC AGGGCTCGTT CTCCAAGACC
GTCGCCCCGG GCCTGCGCCT GGGCTACCTG ACCGCTTCGC GCGACCTGTT CCCCTTTCTG
GTCCAGCTCA AGCAGGCGGC CGACCTGCAC ACCAACAGGC TTAGCCAATG GATGGTGCTG
CAGTATCTGA ACGACCCCGG CCGCGCCGAG CGGATGGCGC GGGTCGCCGA CCTCTATCGC
CGCAAGCGCG GCGTGTTCGC CCAGGCCCTG ACGCGGCATC TGGGCAATAT GGCCTCGTGG
TCGCTGCCGC CGGGCGGGCT GTTCTTCTGG CTGACGTTGA AGGGCGATGT CGGTGTGGAG
GCCCTGCTGA AGAACGCCGT CGAGCGCGGC GTTCTGTTCA CGCCGGGCAG CCATTTCCTG
GCGGAGGGCG GGGCGAGCCC GACAATAAGG CTGAACTTCA GCCTGGCGGA GCCCGAGGCG
GCGGAGCGGG GGCTGGCGAT TTTGGCGGAG CTGCTGCGGG AAGCCGGTGA GCCCTCCCCT
ATCAACCGTC ATTCCCGCCC TTGTGGCGGG AACCCCTGA
 
Protein sequence
MARLSMRSGD IRPSPVRDML NVSQRPGMIS FAGGLPAPET FAGLELPPPP RDLLQYGPTE 
GEPALRERIA QDLAALGLDT EPDRVLLLSG SQQGIDLTAK LTIDAGTRLA VESPAYLAAL
QVFRFYGARF QVTDRADPAA GWAEGPPALA YVIPTFQNPT GQCWSADERQ AMAQACQAHD
VILFEDDPYR DLVYEPCERR PACAWMKTGS WIYQGSFSKT VAPGLRLGYL TASRDLFPFL
VQLKQAADLH TNRLSQWMVL QYLNDPGRAE RMARVADLYR RKRGVFAQAL TRHLGNMASW
SLPPGGLFFW LTLKGDVGVE ALLKNAVERG VLFTPGSHFL AEGGASPTIR LNFSLAEPEA
AERGLAILAE LLREAGEPSP INRHSRPCGG NP