Gene TM1040_1299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1299 
Symbol 
ID4078498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1390120 
End bp1392111 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content62% 
IMG OID638006607 
Productpeptidase U35, phage prohead HK97 
Protein accessionYP_613294 
Protein GI99081140 
COG category 
COG ID 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.284846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCGCG AAGTCACAGC GGAGCAGATC AACGCGCGCG GCGAGGGCGG TGCGATGCGC 
CGCATGGGAG AGGTGCGCGA AATCAACGTC GAGGCGCGCA CTGTCGAGCT TGCCTTCTCG
AGCACGACCC CGGTGCGGCG CTGGTTTGGC GATGAGGTGC TTTCCCATGA CGCCGAGGCC
GTGGTTCTGG ACCGTCTGCT CGACGGCGGG GCAGTATTGG TCGGGCACAA TTGGGACGAT
CAGGTCGGCG TTGTGCAAAG TGCGCGCGTC GATTCCGACG GCGTCGGGCG CGCTGTTGTG
CGGTTCGGCA AGAGCGCCCG CGCGAGCGAG ATCTTTCAGG ACATCGTCGA CGGCATTCGG
CAGCACGTCT CGGTTGGCTA CCGGGTGATC ACGATCAGCG AGGAAATACG GGAAGGTCAG
CCGAACCTTA TCACGGTCAC GCGCTGGGAG CCGTTCGAGA TTTCGGTCGT ACCGGTGCCA
GCTGATCCGA CAGTCGGCAT CGGTCGCGCG CTGGAAAATC CGCCAGAGGC GGGCGGGGCG
GATCGTGGCC AAACTGGCGA AGAGAATGCG GGCGCGGTGG CCGAGCCCAA CGATACAGGA
CAAAGGGAAA CACAGATGAA AACCATCATC ACCCGCGACG CCGAGGGCAA TCTTGTCCGG
GCCAAGGTCG ACGAAAACGA TCAGATCATC GAAGTGATCG AGGTGCTCGA ACGTGCAGGC
GCAGCGGAAA CTGCCATGCT GTCGCGTCTG CAGGAGCAGG AAGCGGCCCG CGTGCGCGAA
CTGACCGAGT TGGGTCGCGA ATACGACGCG CCGGAGCTCG CAACAGAAAT GATTGCCGGG
CGTCATGGCG TGACCGACAT GCGCGAACGC CTGCTTGATC ACCTACACCA GCGCAGCAAT
GAAAGTCGCC AAATTTCTGA GCGTTCCAGC ATTGGCCTGA CCGACAGCGA AACAGAGCAG
TTTTCGTTCC TGCGCGCGAT CCGGGCGCTG GCAAATCCGA CAGACCGCAG CGCCCAGGAA
GCGGCTGCTT TCGAGTTCGA GGTCTCCGAT GCTGCCGCCG AGGCGCAGGG GCGCGATGCG
CAGGGCGTAA TGGTGCCGAT GGACGTGCTG ATGCGTGCGC CGCTGAACAC GGGTTCCGGT
GGCGCGACCG CTGCCGATAC CGGCGGCAAC ACCATCGCAA ACCCGTTGCT GACGCAGAGC
TTTATTCAGA TGCTGCGCAA CCGTACGATC CTTTTGCAGC TCGCGACTCC GCTGATGGGT
CTGGTGGGCA ACCCTGATAT CCCGACGCAG GAAGGTGGTG CGACCGGCTA CTGGATCGGT
GAGGATATCG AGGCGACAGA GGATCTGCTG TCTCTGGGTC AGCGCCAGTT TTCTCCGAAA
ACTGTTGCGG CCTATTCGGA GATCACGCGC CGCACTCTCA AGCAATCCAG TTTGGATATC
GAGGCGCTGG TGCGTAGTGA CCTTGCGCTC GCATTGGCAA CCTCGCTGGA TTTTGCGGGT
TTCTATGGCA CCGGCGCCAA CGATCAGCCG CTGGGTATCG CGAATACGAG CGGCGTGAAT
GTGGTCGACT TTGGCGGCGC AGCGTCTGGT GGCGGATCCG CCCTCCCGAC CTGGGCCGAA
GTGATCCAGA TGGAGAGCGA GATTTCTGCT GCAAATGCCG ATGTGAATAG CATGGCGTAT
GTGCAGAACG CCAAGATGCG TGGTCACTTC AAGAGCACGC AGAAATTCAG CGGCACAAAC
GGTGCGCCCA TCTGGGAGAG CGACAACACC GTGAACGGAT ATCGCGGCGA AGTTACCAAC
CAGATTAAAG ATGGTGACGT GTTCCACGGT GATTTCGCGA ACGTCCTGGT TGGCATGTGG
GGTGGTCTGG ATATTACGGT GGACCCGTAC ACCCACAGCC GCCGCGGGCG CCTGCGCATC
GTGACGATGC AGGACGCGGA TTATGTTCTG CGTCACCCGG CGGGCCTCTG CTTCGGCACT
GACGCCAGCT AA
 
Protein sequence
MTREVTAEQI NARGEGGAMR RMGEVREINV EARTVELAFS STTPVRRWFG DEVLSHDAEA 
VVLDRLLDGG AVLVGHNWDD QVGVVQSARV DSDGVGRAVV RFGKSARASE IFQDIVDGIR
QHVSVGYRVI TISEEIREGQ PNLITVTRWE PFEISVVPVP ADPTVGIGRA LENPPEAGGA
DRGQTGEENA GAVAEPNDTG QRETQMKTII TRDAEGNLVR AKVDENDQII EVIEVLERAG
AAETAMLSRL QEQEAARVRE LTELGREYDA PELATEMIAG RHGVTDMRER LLDHLHQRSN
ESRQISERSS IGLTDSETEQ FSFLRAIRAL ANPTDRSAQE AAAFEFEVSD AAAEAQGRDA
QGVMVPMDVL MRAPLNTGSG GATAADTGGN TIANPLLTQS FIQMLRNRTI LLQLATPLMG
LVGNPDIPTQ EGGATGYWIG EDIEATEDLL SLGQRQFSPK TVAAYSEITR RTLKQSSLDI
EALVRSDLAL ALATSLDFAG FYGTGANDQP LGIANTSGVN VVDFGGAASG GGSALPTWAE
VIQMESEISA ANADVNSMAY VQNAKMRGHF KSTQKFSGTN GAPIWESDNT VNGYRGEVTN
QIKDGDVFHG DFANVLVGMW GGLDITVDPY THSRRGRLRI VTMQDADYVL RHPAGLCFGT
DAS