Gene TM1040_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3800 
Symbol 
ID4074951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp53072 
End bp54244 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content56% 
IMG OID638004459 
Producthypothetical protein 
Protein accessionYP_611194 
Protein GI99077935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.945188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCAGT TCACGGAAAC AACGCAAGTT GGCCTGTTTG CCCGGCTGAA GAACGCCATC 
AAAGGCATCG GGCTTGGAAT CAGCTTTATC GGTATCGCTG TGTATTTTCT GTTTTGGAAC
GAAGGCAATG CCGTGCGTAC AGCTCGGGCG CTTGCCGAAG GGGCCAACCA GGTCGTGTCG
GTCGACCACA CGGCAATAGA TCCGCAAAAC GAAGATCGCC TTCTGCATAT AGGCGGCCCG
CTTGCGCTTG AGGTGCCGCT GGCGGATGTG GCGCTGGGGG TGGTTGCGTC TGCACAAACC
GTGCGTCTTG AACGCAAGGT AGAGCAATTT GCCTGGATCG AAGACAAGCA AACCAAGACG
GAGACAAAAC TAGGTGGCGG GCAGGAAAAG ACCACGCGAT ACACCTATCG TCAGGGTTGG
ACGGATGCGC CAGCCAGTGG AGCAGAGTTT CGGGTCCCCG AGGGGCATAT GAACCCGCCG
ATGCCAATCG CATCGAAAGT GATCCGACAG CCAGAGGGCA CAATTGGTGC TTTTACCGTA
GATGATGAGA TTTCAGATCT GGGCGGTTCA ACACCAATGC TGTTGGACTC ACAGCAGGCG
GAGGATGTCG CGCGGGCTCT TTCGTTGCAG CAACCGGCGA AACTGGTCGC TGGGCAGGTG
GTTTTTGGTG CAGATGTCAC GGCTCCGCAG CTGGGTGATA TCCGGGTGAG CTATCGCGTA
TCTGAAATCG AAGAGGCGAG CGTGGTCGGT GTACAGCGCA GCGACACGTT GTTGCCCTAC
ACTGCCCAAA ACGGGCGCAA GATCTACTTG GTGGCAGAAG GCTTGAAGAC TGCGGACGAG
ATGTTCCAGA CAGCTGTTTC CAACAACACC TTCAAAACTT GGATGTTGCG CATCGGTCTC
TTGGTCCTGC TGTTTTTGGG ATTTAAGGCG CTGTTCGGCG TCGTAGACGT AATTGCCAGC
ATTCTGCCGT TTCTGGGATG GATCACGGCT TCTGTCACCT CCTTGATCAG CGTTGCTCTT
ACGCTGGTTG TCGGTGGCAC CACGATAGCG ATTGCTTGGG TCTATTTCCG CCCAGTTCTG
GCGCTCCTTA TCATTGCTGT CGCTTTGGCC GGAGCAGCCG CCAGCGCCTA TTGGCTGCGG
AAAGCGGCGC CCGAGACACC TAAGACACCC TGA
 
Protein sequence
MSQFTETTQV GLFARLKNAI KGIGLGISFI GIAVYFLFWN EGNAVRTARA LAEGANQVVS 
VDHTAIDPQN EDRLLHIGGP LALEVPLADV ALGVVASAQT VRLERKVEQF AWIEDKQTKT
ETKLGGGQEK TTRYTYRQGW TDAPASGAEF RVPEGHMNPP MPIASKVIRQ PEGTIGAFTV
DDEISDLGGS TPMLLDSQQA EDVARALSLQ QPAKLVAGQV VFGADVTAPQ LGDIRVSYRV
SEIEEASVVG VQRSDTLLPY TAQNGRKIYL VAEGLKTADE MFQTAVSNNT FKTWMLRIGL
LVLLFLGFKA LFGVVDVIAS ILPFLGWITA SVTSLISVAL TLVVGGTTIA IAWVYFRPVL
ALLIIAVALA GAAASAYWLR KAAPETPKTP