Gene TM1040_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3685 
Symbol 
ID4075654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp743586 
End bp744743 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content59% 
IMG OID638005205 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_611914 
Protein GI99078656 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.295297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA AACACCCCGA CGGCCGTTTT GTGGATGATG CGCCCTATGA CCCGCAGGCC 
TCGATCCGCG AACTGGAGCG CAAGGATCTC GACGCGCCGA ACTGGGTGCT GATCTGGCGC
AAGTTCAAGA CCCACCGCCT GGGGTTGATC TCGGGCATAT TCCTGCTGTT TTGCTATGTG
ATGCTGCCGT TTGTAGGCTT CATCGCGCCC TATGGCCCCA ACGAGCGCAA CGCCGAGCAT
CTCTATTCGC CACCGCAGTC CGTGAACTTC TTTCACGAGG GCGAATTCCT GGGCCCGTTT
ATCTATCCGC TGACTTCAGA GGCCGATCTT GAGACATTTC AATGGGTGGT GAAGCCTGAT
TACGACAACC CGCAACAGAT CCGTTTCTTT TGCGAGGGGG CCGAATATCG ACTGGCGGGT
CTGATCCCTG CCAACACCCA TCTCTTTTGC GCGCCCGAAG GGGCGACATT GTTTCTCTGG
GGGTCAGATC GCCTGGGACG TGACATCTTC AGCCGCATTC TCTTTGGCGC GCAGCTCTCG
CTCACCGTGG GTCTGATCGG CATCACGGTA TCTTTTGTTC TCGGGATCTT TTTTGGCTCT
GTTGCGGGGT ATTTCGGCGG CAAGATCGAC TGGGTCATCA ACCGCGCCAT CGAAATCCTG
CGCAGCCTGC CCGAGCTGCC GCTCTGGCTG GCGCTTTCAG CAGCGGTCCC CTCCACATGG
TCGCCGGTGG CAGTGTTTTT CATCATTTCC ATCATCCTCG GCATCCTCGA CTGGCCGGGT
CTTGCGCGTT CCGTCAGGGC CAAGTTCCTG AGCCTGCGCG AAGAGGAATA CGTCCGCGCC
GCCGAAATGA TGGGCGCATC TTCAGGGCGC GTGATCAAGA AACACCTGCT GCCCAACTTC
ATGAGCCATC TTATCGCCTC GGCCACATTG TCGATCCCGG CAATGATCTT GGGAGAGACG
GCGCTCTCGT TCCTTGGGCT CGGTCTGCGC GCCCCGGCAG TGAGCTGGGG GGTGATGCTC
AATGACGCGC AGAACCTTGC CAATATCGAG ATCTACCCCT GGACCGCGAT CCCAATGCTG
CCGATCATCG TGGTCGTTCT GGCGTTCAAC TTTCTGGGCG ACGGTCTGCG CGATAGTCTG
GATCCCTATC AGCAATGA
 
Protein sequence
MSEKHPDGRF VDDAPYDPQA SIRELERKDL DAPNWVLIWR KFKTHRLGLI SGIFLLFCYV 
MLPFVGFIAP YGPNERNAEH LYSPPQSVNF FHEGEFLGPF IYPLTSEADL ETFQWVVKPD
YDNPQQIRFF CEGAEYRLAG LIPANTHLFC APEGATLFLW GSDRLGRDIF SRILFGAQLS
LTVGLIGITV SFVLGIFFGS VAGYFGGKID WVINRAIEIL RSLPELPLWL ALSAAVPSTW
SPVAVFFIIS IILGILDWPG LARSVRAKFL SLREEEYVRA AEMMGASSGR VIKKHLLPNF
MSHLIASATL SIPAMILGET ALSFLGLGLR APAVSWGVML NDAQNLANIE IYPWTAIPML
PIIVVVLAFN FLGDGLRDSL DPYQQ