Gene TM1040_3311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3311 
Symbol 
ID4075716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp320075 
End bp321163 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID638004819 
ProductLacI family transcription regulator 
Protein accessionYP_611545 
Protein GI99078287 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.472161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.750741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA GGACCCGCAT AACCATCAAG GATGTCGCCC TCGCTGCCGG ATGTGGCGTC 
GCAACTGCGA GTCGCGTGCT GAATAAATCC GGCCCCGCCA GCGCTGAGAC CCGCGCTCGT
GTTGAAGATG CAGCGCGGCG CATGGGTTTT GTCTTTTCCG CGACTGGACG CGCCTTACAA
AGCCGCAAGA GCATGACCGT CGGTTGCCTT ATTCCGTCGC TCGCCAACCC GGTGTTTGCG
GAGGCCGTGC AGGGGGCTCA GGAAGAGTTG CGCAGCCACG GGTATCAGCT GTTGGTCGCC
AGCTCCAATT ACGATGACGA AACGGACAAT GACATTCTCA CAACTCTCCT GAGTAAGGAT
GTCGACGGCC TGTTGGTGAC AATGGCCGCG CCACAGGAGA GTGTGCCGCT CGCACAGGCT
CGGGCGCGCG ACATCCCCGT CTGCCTGATG TTTCACGACC CTCTGCCCGA CTGGCCCAGC
GCGCATGTGT CCAACGCGCA GGCCGCCGCA GAAGTCGCGC GCCAGTTTGC CCTCTACGGC
CACGAGCGCA CCGGATTTCT GGCGTTGCGA TTTTCGACCT CGGACCGCTC ACGCAACCGG
TTTGACGGGT TTCGGGCGCA ATGTGCCGCC CTCAATCTGG CCCCACCCAA GCTGATCGAG
ATCACCGAGA CCGAAGCCAA CACGCCCAGT ATCCTTGCCC AGCGCCTCTC GGACCACCCG
GATCTGACCG CGATCTTTGC CTCCAATGAC TTCCTGGCAA TCGCCGTGCA AAAAGCCGCG
CCTTTGATGG GGCGGCATGT CCCGCAGGAT TTGTCGGTTG TGGGGTTCGA TGGGATCGAG
GTCGGGCGTT TGCTGGATCG CTCGCTTGCC ACGATCGAGA CCACCCCAGA GGCAATGGGT
CGCCAGGCCG CACAGACGCT TTTGACCGGA CTGCAGGGCG GCGCAATGAC AGAACTTGCG
CCCCTGCCTT TTACTTTCCG CGCCGGGGCA ACCTTGTCCG GACCCCGCGC GAAAAGCCCT
GACGACGACC GGGGTGCTGC CCAGTCGCCG TCTGTTCCCC CGTTCACGTC CAACGACAAA
CAAGGATGA
 
Protein sequence
MTDRTRITIK DVALAAGCGV ATASRVLNKS GPASAETRAR VEDAARRMGF VFSATGRALQ 
SRKSMTVGCL IPSLANPVFA EAVQGAQEEL RSHGYQLLVA SSNYDDETDN DILTTLLSKD
VDGLLVTMAA PQESVPLAQA RARDIPVCLM FHDPLPDWPS AHVSNAQAAA EVARQFALYG
HERTGFLALR FSTSDRSRNR FDGFRAQCAA LNLAPPKLIE ITETEANTPS ILAQRLSDHP
DLTAIFASND FLAIAVQKAA PLMGRHVPQD LSVVGFDGIE VGRLLDRSLA TIETTPEAMG
RQAAQTLLTG LQGGAMTELA PLPFTFRAGA TLSGPRAKSP DDDRGAAQSP SVPPFTSNDK
QG