Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2684 |
Symbol | |
ID | 4077595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2822202 |
End bp | 2824085 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638008009 |
Product | oligopeptide/dipeptide ABC transporter, ATP-binding protein-like |
Protein accession | YP_614678 |
Protein GI | 99082524 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00440045 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.764263 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCC TGCGTCTTTT GTCCCGCAAC CGCCTCGCGC TGGCGGGGCT CATCGTGATG TCGGTGGTGC TGCTGCTGGC GGTGCTGACA CCCATTTTGC CGCTGCCCGA CCCGGATGTG ACCAACACTG CAGAGCGGTT CAAGAAACCC TTTAGCGAGG GGGCCTTGCT GGGCACTGAC CACCTTGGTC GGGATCTCGC CAGCCGCCTG ATGTGGGGCA CGCGGCTGTC GCTGGCAGTG GGCTTTGCGG CAGCGGTGGC GGCGGCCACC ATCGGGGCGG CCATCGGCGT GATCGCCGGT TTTTATGGCG GGCGCGTGGA CAATGTGATC ATGCGCGGCG TCGATATGCT GATGGGCTTT CCCTATATCC TCCTGGCGCT GGCGATTGTC GCAGCACTTG GTCCGGGGCT GATGAATGCG CTGATCGCTG TGGCCGCCGT CAACATTCCC TTCTTCGCGC GCAACATTCG CGGTGTCACC GTCGGCATTG CGCACAAGGA ATTTATTGAT GCGGCGCGGC TGTGCGGGAT GTCGAATGCG CGCATTATCA TCACCGAGGT GGTGCCAAAC GTGATCCCGG TGATCGTGAT CGCCATGTCC ACCACTGTCG GCTGGATGAT CCTCGAGACG GCAGGTCTCA GTTTCCTTGG CCTTGGTTCG CAACCGCCGC AGGCGGATCT GGGCTCCATG CTGGGGGAGG CACGCTCGGC GCTGATTACC AATCCGCATA CCTCCGTGGT GCCCGGTGCG ATGATCCTCG TGATCGTGAT GGCGATCAAC CTTCTGGGCG ACGGCGTGCG CGACGCGCTT GATCCGCGCC TGAAATCCGG CGCGCTCAGC CGCCCGATGC CGACCACCAT GGTGCGCCGC ACAGACCCCG TACCGCAGCC CGAAGGCGAC GGTATCCTGA GCCTTTGCAA CCTGCAAACC CAGTTCCACA TCAAGGATCG CATCTACAAG GCCGTGGGCG GCGTGGATCT CTCGGTAAGG CCGGGCGAAT GCCTTGGGAT CATTGGTGAA AGTGGCTCTG GTAAATCCGT GACGGCGCTG TCGATCATGG GGCTGGTGGC CTCGCCCCCC GGTGTCATCA CCGGCGGTGC AGTGCATTAC AAGGGCGAGG ATCTGATCGG TGCGCCCTAT GAGACCCTGC GCCGTCTGCG CGGTGACCGC GTGGCCTATA TCTTTCAGGA TCCTCTGGCG ACGCTGCACC CGCTCTATAC GGTTGGCGCG CAGCTCATCG AGGCGATCCA GAGCCATCAT CGCACCAGCA CCTCCGAGGC GCGTGCCCGC GCGATTGAGC TTTTGAAATC CGTGCGCATC CCCAATGCCG AGGCGCGCGT GGACAATTAC CCGCATGAGA TGTCGGGCGG CATGCGCCAG CGGGTCGGCA TCGCCATGGC GCTGGCCAAC GACCCTGAGG TCATCATCGC GGATGAGCCC ACAACCGCGC TGGATGTGAC GGTGCAGGCG CAGATCCTTG CGCTCCTGGA TGACTTGCGG CGCGAGCGGG GCTTGGCGAT CATCTTCATC ACCCATGATT TTGGCGTGGT GGCGCAGCTC TGTGATCGGG TGGCGGTGAT GTATGCGGGC CGCATTGTCG AGGAAGGCCC CACCGATGCC ATTCTCAATG CGCCTGCTCA CCCCTATACC GCGCGGCTGA TGGCCTGCGT GCCGGAACTG GGCCAGGGCA GGCGCGAGCT TGCGGCCATT CCCGGTCTAC CGCCTGTGGT GGACAAACTG CCCGCGGGGT GCGCCTTTGC GGATCGCTGC CCCAAGGCCG CCAAAGCCTG CCGCAGCGGG GACATTGCGC TCGATGGGTT TGGTGCGGGG CGCAAGATCC GCTGTATAGA CCCTGAAATT CCAGTGCAGG AGGCCACGGC ATGA
|
Protein sequence | MSFLRLLSRN RLALAGLIVM SVVLLLAVLT PILPLPDPDV TNTAERFKKP FSEGALLGTD HLGRDLASRL MWGTRLSLAV GFAAAVAAAT IGAAIGVIAG FYGGRVDNVI MRGVDMLMGF PYILLALAIV AALGPGLMNA LIAVAAVNIP FFARNIRGVT VGIAHKEFID AARLCGMSNA RIIITEVVPN VIPVIVIAMS TTVGWMILET AGLSFLGLGS QPPQADLGSM LGEARSALIT NPHTSVVPGA MILVIVMAIN LLGDGVRDAL DPRLKSGALS RPMPTTMVRR TDPVPQPEGD GILSLCNLQT QFHIKDRIYK AVGGVDLSVR PGECLGIIGE SGSGKSVTAL SIMGLVASPP GVITGGAVHY KGEDLIGAPY ETLRRLRGDR VAYIFQDPLA TLHPLYTVGA QLIEAIQSHH RTSTSEARAR AIELLKSVRI PNAEARVDNY PHEMSGGMRQ RVGIAMALAN DPEVIIADEP TTALDVTVQA QILALLDDLR RERGLAIIFI THDFGVVAQL CDRVAVMYAG RIVEEGPTDA ILNAPAHPYT ARLMACVPEL GQGRRELAAI PGLPPVVDKL PAGCAFADRC PKAAKACRSG DIALDGFGAG RKIRCIDPEI PVQEATA
|
| |