Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3685 |
Symbol | |
ID | 4075654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 743586 |
End bp | 744743 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638005205 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_611914 |
Protein GI | 99078656 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.295297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAA AACACCCCGA CGGCCGTTTT GTGGATGATG CGCCCTATGA CCCGCAGGCC TCGATCCGCG AACTGGAGCG CAAGGATCTC GACGCGCCGA ACTGGGTGCT GATCTGGCGC AAGTTCAAGA CCCACCGCCT GGGGTTGATC TCGGGCATAT TCCTGCTGTT TTGCTATGTG ATGCTGCCGT TTGTAGGCTT CATCGCGCCC TATGGCCCCA ACGAGCGCAA CGCCGAGCAT CTCTATTCGC CACCGCAGTC CGTGAACTTC TTTCACGAGG GCGAATTCCT GGGCCCGTTT ATCTATCCGC TGACTTCAGA GGCCGATCTT GAGACATTTC AATGGGTGGT GAAGCCTGAT TACGACAACC CGCAACAGAT CCGTTTCTTT TGCGAGGGGG CCGAATATCG ACTGGCGGGT CTGATCCCTG CCAACACCCA TCTCTTTTGC GCGCCCGAAG GGGCGACATT GTTTCTCTGG GGGTCAGATC GCCTGGGACG TGACATCTTC AGCCGCATTC TCTTTGGCGC GCAGCTCTCG CTCACCGTGG GTCTGATCGG CATCACGGTA TCTTTTGTTC TCGGGATCTT TTTTGGCTCT GTTGCGGGGT ATTTCGGCGG CAAGATCGAC TGGGTCATCA ACCGCGCCAT CGAAATCCTG CGCAGCCTGC CCGAGCTGCC GCTCTGGCTG GCGCTTTCAG CAGCGGTCCC CTCCACATGG TCGCCGGTGG CAGTGTTTTT CATCATTTCC ATCATCCTCG GCATCCTCGA CTGGCCGGGT CTTGCGCGTT CCGTCAGGGC CAAGTTCCTG AGCCTGCGCG AAGAGGAATA CGTCCGCGCC GCCGAAATGA TGGGCGCATC TTCAGGGCGC GTGATCAAGA AACACCTGCT GCCCAACTTC ATGAGCCATC TTATCGCCTC GGCCACATTG TCGATCCCGG CAATGATCTT GGGAGAGACG GCGCTCTCGT TCCTTGGGCT CGGTCTGCGC GCCCCGGCAG TGAGCTGGGG GGTGATGCTC AATGACGCGC AGAACCTTGC CAATATCGAG ATCTACCCCT GGACCGCGAT CCCAATGCTG CCGATCATCG TGGTCGTTCT GGCGTTCAAC TTTCTGGGCG ACGGTCTGCG CGATAGTCTG GATCCCTATC AGCAATGA
|
Protein sequence | MSEKHPDGRF VDDAPYDPQA SIRELERKDL DAPNWVLIWR KFKTHRLGLI SGIFLLFCYV MLPFVGFIAP YGPNERNAEH LYSPPQSVNF FHEGEFLGPF IYPLTSEADL ETFQWVVKPD YDNPQQIRFF CEGAEYRLAG LIPANTHLFC APEGATLFLW GSDRLGRDIF SRILFGAQLS LTVGLIGITV SFVLGIFFGS VAGYFGGKID WVINRAIEIL RSLPELPLWL ALSAAVPSTW SPVAVFFIIS IILGILDWPG LARSVRAKFL SLREEEYVRA AEMMGASSGR VIKKHLLPNF MSHLIASATL SIPAMILGET ALSFLGLGLR APAVSWGVML NDAQNLANIE IYPWTAIPML PIIVVVLAFN FLGDGLRDSL DPYQQ
|
| |