Gene TM1040_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3478 
Symbol 
ID4075112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp502467 
End bp503726 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content60% 
IMG OID638004987 
Producthypothetical protein 
Protein accessionYP_611712 
Protein GI99078454 
COG category[S] Function unknown 
COG ID[COG3748] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.300585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAGC TTGTCATCAT GTGGGACTGG CTGGGGTTTG CCGTCCGCTG GCTACATGTC 
ATCACCGCGA TCGCCTGGAT CGGGTCGTCC TTTTATTTCG TGGCGCTGGA TCTGGGGCTG
CGCAAGGTGC CGCATCTTCC GGTGGGCGCG CATGGTGAAG AGTGGCAGGT GCACGGCGGT
GGCTTTTACC ACATCCAGAA ATATCTGGTC GCCCCGGCCA ATATGCCCGA CCACCTGATC
TGGTTCAAAT GGGAGAGCTA CGCCACCTGG CTCTCGGGGG CTGCGCTCCT GATGATCGTC
TACTGGGCGG GCGGCGAGCT CTATCTCATT GACGCAAACA AGGCCGACCT GGCGCTGTGG
CAGGGGATCC TGATTTCCGG CGCATCGCTG AGCGTGGGCT GGCTGGTCTA TGACTTTTTG
TGCAAATCCC CGCTTGGCGA AAAGCCGACC ATGCTGATGG TGCTGTTGTT CGTGCTGCTG
GTAGCCATGG GTTATGGTTA CAACCAGATC TTTACCGGCC GGGCGGTGAT GCTGCATCTG
GGGGCCTTCA CGGCGACCAT CATGACGGCG AATGTGTTCT TTATCATCAT GCCGAACCAG
CGCATCGTGG TAAAAGACCT GCAAGAAGGG CGCACGCCGG ATGCAAAATA CGGCAAGATC
GCCAAGCTGC GCTCGACGCA CAACAACTAC CTGACCTTGC CGGTGATCTT CCTGATGCTC
TCCAACCACT ATCCGCTGGC CTTTGCCACC GAGTACAACT GGCTGATTGC GGCGCTTGTG
TTCCTGATGG GGGTGACGAT CCGTCACTAC TTCAACACGC GGCACGCGGG CGCGGGCAAT
CCGACCTGGA CCTGGCCCGT GACGATCCTG CTGTTCATTG CCATCATGGC ATTGAGCCAA
GCGCCGCTGG TGCAGGACAC CTATGAGGAG TCCGAGGCGC GTACACTGAC CCAAACCGAG
CTGGCTTTTG CTGGCTCGGA GCATTTCGAG GACGTGATGA ATGTGGTGCC GGGACGTTGC
GCCATGTGCC ACGCGCGCGA GCCTTATTAC GACGGCATCC GCCGCGCGCC CAAGAATGTG
TTGCTCGAAA CACCGGCAGA TGTCGCCAAA TACGCCCGGC AGATCTATCT TCAGGCCGGG
GCGACCCATG CAATGCCGCC TGCAAATGTG ACCTATATGG AAGAGGACGA GCGCGCGCTC
ATCCGTCAAT GGTATGCAGA TGGCGCGGCA GAGCTGCCCT TCTATCTGGC GGGCAACTGA
 
Protein sequence
MQELVIMWDW LGFAVRWLHV ITAIAWIGSS FYFVALDLGL RKVPHLPVGA HGEEWQVHGG 
GFYHIQKYLV APANMPDHLI WFKWESYATW LSGAALLMIV YWAGGELYLI DANKADLALW
QGILISGASL SVGWLVYDFL CKSPLGEKPT MLMVLLFVLL VAMGYGYNQI FTGRAVMLHL
GAFTATIMTA NVFFIIMPNQ RIVVKDLQEG RTPDAKYGKI AKLRSTHNNY LTLPVIFLML
SNHYPLAFAT EYNWLIAALV FLMGVTIRHY FNTRHAGAGN PTWTWPVTIL LFIAIMALSQ
APLVQDTYEE SEARTLTQTE LAFAGSEHFE DVMNVVPGRC AMCHAREPYY DGIRRAPKNV
LLETPADVAK YARQIYLQAG ATHAMPPANV TYMEEDERAL IRQWYADGAA ELPFYLAGN