Gene TM1040_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1057 
Symbol 
ID4077197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1136156 
End bp1137433 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content68% 
IMG OID638006361 
Producthypothetical protein 
Protein accessionYP_613052 
Protein GI99080898 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0533808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATC ACACATTGGC CGCGCTGCCC TATATGTTTG ATCTCTGGGC GCTCCCGCAT 
CAGTGCCCCC CGCTGGGCGA CTGGCGCGCC TGGGTGATCC TTGGCGGGCG CGGGGCGGGG
AAGACGCGCG CCGGTGCGGA ATGGGTGCGC AGCCAGGTCG AGGGGGCAGG GCCCTTTGGC
GTCGGGTCTG CGCGTCGCGT GGCGCTGGTG GGGGAGACCT ATGATCAGGT ACGCGACGTG
ATGATCCACG GTGACAGCGG TATCCTTGCC TGTTCGCCAC CGGATCGACG CCCGGAGTGG
CGTGCGGGCG AACGCCGACT GCTCTGGCCC AACGGGGCAA GCGCGCAGGC GTTCTCGGCC
TCTGATCCAG AGGTGCTGCG TGGGCCGCAG TTCGATGCGG CCTGGGTTGA CGAGCTGGCC
AAGTGGCGCC GCGCACAGGA GGCCTGGGAC ATGTTGCAGT TTGCCCTGCG GCTTGGGACA
GCGCCGCGCG TCTGCGTCAC CACCACACCG CGCAATGTAC CCCTTCTGAA GGGGCTGCTT
CAAAGCCCCT CGACCGTCAC CACCCATGCC CCCACCGAGG CCAACAGCGC AAATCTTGCA
CCAAGTTTCC TCAGTGAAGT GCGGGCGCGC TATGCGGGCT CCCGACTGGC GCGGCAGGAG
CTTGATGGTG TCTTGCTTGC GGATGTGGAC GGCGCGCTCT GGAGCTCTGA CATGCTGGCA
GAGATTCAGC GGCGCGACAC CCCGCGTCTT GATCGCATCG TGGTGGCCGT CGACCCCTCG
GTGAGCGCGC ACAAGGGCTC TGATGCCTGC GGGATCATCG TTGCAGGCGC GCAGACACAG
GGGCCGATCT CGTCGTGGCG GGCCTATGTG CTGGCCGATC ATACGGTTCA GGGGCTTGGC
CCCACCGGCT GGGCACGGGC GGCGATTGCG GCGCGCGATG CCTACAAAGC GGACCGCCTG
GTGGCAGAGG TCAACCAAGG CGGCGCGCTG GTGGGCACGG TGTTGCGCCA GGTGGATCCC
TTGGTGCCTT TCACCCCGGT CCATGCCAGC AAAGGCAAGG CGGCGCGGGC GGAGCCCGTC
GCGGCGCTCT ATGAGCAGGG GCGCGTGCAT CATGCGCCGG GCCTGCAAGA GCTCGAAGAG
CAGATGTGCC TGATGACCGC GCAGGGCTAT CGCGGGGATG CATCGCCGGA TCGTGTGGAT
GCGCTGGTCT GGGCGCTGCA TGCGCTGATT ATCGGCCCGG CAGAGCAGCA TCGCTGCCCC
AAGATCCGCC GGCTCTGA
 
Protein sequence
MDDHTLAALP YMFDLWALPH QCPPLGDWRA WVILGGRGAG KTRAGAEWVR SQVEGAGPFG 
VGSARRVALV GETYDQVRDV MIHGDSGILA CSPPDRRPEW RAGERRLLWP NGASAQAFSA
SDPEVLRGPQ FDAAWVDELA KWRRAQEAWD MLQFALRLGT APRVCVTTTP RNVPLLKGLL
QSPSTVTTHA PTEANSANLA PSFLSEVRAR YAGSRLARQE LDGVLLADVD GALWSSDMLA
EIQRRDTPRL DRIVVAVDPS VSAHKGSDAC GIIVAGAQTQ GPISSWRAYV LADHTVQGLG
PTGWARAAIA ARDAYKADRL VAEVNQGGAL VGTVLRQVDP LVPFTPVHAS KGKAARAEPV
AALYEQGRVH HAPGLQELEE QMCLMTAQGY RGDASPDRVD ALVWALHALI IGPAEQHRCP
KIRRL