Gene TM1040_1818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1818 
Symbol 
ID4076964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1912042 
End bp1913139 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content61% 
IMG OID638007133 
Producthypothetical protein 
Protein accessionYP_613813 
Protein GI99081659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0390612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.325282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGA AAGATGCCGA GGAAACCCAC CGTTTTCCGT GCGAGCAATG TGGCGCGGAC 
TATCGTTTTG CCCCGGCGGA GGGCGGGCTT GTCTGTGATC ATTGCGGCCA CAAGAAAGAG
CTGATCGAAA GCCCATGGGG CGGTGGGGCG CTGAAGGAAC TCGACTTCCT GCAAGCGCTG
CGTGAACAGC TTCCCGCTGC CGAAATGGAA GTCACGCGCG TCTCCTCCTG CCCCAACTGC
GCCGCCCAGG TCGAATTCGA CCCGGCGGTG CATGCGCTGG AATGTCCCTT CTGCGCGACC
CCCGTGGTGG CCGACACCGG CGAGAATCGC CACATCAAAC CCAAGGGCGT GCTGCCCTTT
CAGTTTGACG AACGCGCCGC ACATCAGGCG ATGGAAGACT GGCTCGGCAA GCTGTGGTTC
GCGCCCAACG GGCTAAAGGA ATACGCCCGC AAAGGGCGCA AGATGGATGG GATCTATGTC
CCCTACTGGA CCTATGACGC CCAGACCCAC AGCCAATACA CCGGCCAGCG CGGCACCGAA
TATACCGTAA GCCGTACCGT CATGGTGGAT GGCAAACCCC AGGTGCGCAG CGAAATCCGC
GTGCGCTGGT CCAATGTGCG TGGACGGGTG CAGCGATTCT TTGACGATGT GCTGGTGCTC
GCCTCCAAAA GCCTGCCGCG CAAATACACC GAAGGCCTCG AGCCCTGGGA CCTGTCGCAG
CTTGCGCCTT ATCAGCCCAA GTATCTCGCA GGGTTTCGCG CGGAGGCCTA TACCATCGAC
CTTGAAGCCG GCTTTGCCGA TGCGCGGCAA AAAATGGACC GCATCATCGA GCGTGACATC
AAATTCGACA TTGGCGGCGA CCGCCAGCGG ATCAGTTCGG TCGATACGGA CGTCAGCGCG
GTCACATTCA AACATGTGCT GCTGCCGGTG TGGATGGCCG CTTATAAATA TCGCGGGCAG
AGCTATCGCT TTGTGGTCAA CGGGCAGTCG GGGCGCGTGC AGGGCGAACG CCCCTTTTCC
GCCTGGAAGA TCGCAGGCGC GGTTGTCGTC GGCCTGATCC TGGCGGCAGG CGTGGCCTAC
CTGGCGTCAC AGAGTTAA
 
Protein sequence
MTPKDAEETH RFPCEQCGAD YRFAPAEGGL VCDHCGHKKE LIESPWGGGA LKELDFLQAL 
REQLPAAEME VTRVSSCPNC AAQVEFDPAV HALECPFCAT PVVADTGENR HIKPKGVLPF
QFDERAAHQA MEDWLGKLWF APNGLKEYAR KGRKMDGIYV PYWTYDAQTH SQYTGQRGTE
YTVSRTVMVD GKPQVRSEIR VRWSNVRGRV QRFFDDVLVL ASKSLPRKYT EGLEPWDLSQ
LAPYQPKYLA GFRAEAYTID LEAGFADARQ KMDRIIERDI KFDIGGDRQR ISSVDTDVSA
VTFKHVLLPV WMAAYKYRGQ SYRFVVNGQS GRVQGERPFS AWKIAGAVVV GLILAAGVAY
LASQS