Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0422 |
Symbol | |
ID | 4076182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 432884 |
End bp | 434839 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005717 |
Product | BCCT transporter |
Protein accession | YP_612417 |
Protein GI | 99080263 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1292] Choline-glycine betaine transporter |
TIGRFAM ID | [TIGR00842] choline/carnitine/betaine transport |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0280485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.480367 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTT TTTGTGCAAA GTTCGCGTAC GGGCGCGATG CTGCGGCGCG GCGTGATTGG ATCTGCGGCT TTGCGACGTG GCTCTCAGCT TCGCGACATC CCCCCGAATT TCAGTCCCAG CGGATGCCGC ATCGACGAGT TTTCCTGTCG GATCGCAGTT GGCGCAGGTG GCGCCAAATC GAGACGATCG CGCAATCCAC CTGCAGGCTT TTGCTGCGGT GGCAACCAAG ATTGCGCCCG GCACATTCGC CCGAGGTCGC CTTTCCCCAA CAGAGAGGAC CAACCACCAT GCCGCTTAAA CCGCCTTTGA CGGAGCTTGA GATCAAGACT GCCGACAGCG GATTCTACAA AGGGTTTACC AAGGATGTGA CAATCACTGC CAAGATCCTT GTTGCCGCCC TGATCCTCTG GGCCGTGGCC TTTCCAAATC AGGCGGCCAG CGTGCTGTCG CACCTAAACG GGGTCATTCT GGCCAGCTTC AACTACTGGT ACATCTACGT GGTCGCCTTT TTTGTGGTGG TGAGCCTGGG GCTTGCGATC TGGCCTGCAT CCGGACGTCT GCGGCTCGGA GCGGAGCATG AAAAGCCAGA GTTCTCGGCC TTCTCCTGGT TCTCCATGAT GTTCGGCGCT GGCATGGGAA TTGGTGTTCT GACCTATGCC ACCGGTGAGC CGATCTATCA CTTCCAGAAC AACCCGGATG TCATCCAGGG TTTTGTCGAA GGGGCCAGCG CCGAGACGGT GCGCTCTGCC TACAAGTGGT CCTTTTTGCA TTGGGGTCTG TCTCCATGGG GGGCCTATGC GTTGACCGGG CTTGCGCTCG CGTTCTTCGC TTACCGCCGC GGGCTCCCGC TGACCATTCG GTCGGCGCTG ACGCCTCTGT TCGGCGATCG GCTGTCGGGT TTTGCGGGTC ATGCCGTCGA TGTGGTGGCC GTCATCGCGA CCGTGCTGGG GGTGGCGCAA ACGCTCGGCT TTGGGGTCGA GCAATTCGTC GCGGGGCTGA GCCGCATCGG CTTTGGCGAC TGGCTTCTTG TCACAACCGA TGAAGGAACC CAAAAGGCGT CTGTGGTGGC GATCGTGCTG TCGCTGGTGA TCATCATGGG GGCTTCGACG CTGTCGGCCC TGTCGGGTGT CGGCAAGGGG ATCAAATGGC TCTCCAACAT CAACATGGGG TTGAGCGTGT TCTTGCTGTC GTTTTTCCTG ATCTTTGGCT CCACGGTTTT TGGCCTCACG GCGCTGGTGA CAGGGATCTA TGACTATCTC CTGGCTTTTC CCGCGATGCT GTTCACCGTC TGGAGCGACA AAGGCACCGA GACCAGCAGC GCACTTGAGA GCTGGCAGGG TGGTTGGACC ATTTTCTACT GGGCTTGGTG GATTGCGTTT GCTCCCTTCG TCGGTTTGTT CCTCGCCCGG ATCTCTCGAG GTCGGACGAT CCGCGAATAT GTCTTTGGCG CGATGCTGGT ACCATCCGTC ATGTGTTTTG TCTGGTTCGC CCTGATTGGC GGCACCGCCA TCGATCTGGA ACTGTCGGGC ATCGCAGAGG GTGCAATCCT TGGAACGGGG CTGTCAGACC AGCTCTATGC AACCCTCGCA GTTCTCCTCA GCGATGGTCT GGCCTGGGTG TTCTCGGTGG TCGTCGCGGT TCTGTTGATG ACCTACCTTG TCACCTCGGC GGATTCGGCG GTGTTGATCA TCAACACCAT CAACGCCGCA GGTGACGAGG GACCAAAGGC ACGTCCTCAT ATCATCTTCT GGGGCGTCGC CTTGGCGTTG GTTGTCGGAG CTTTGCTGAT CATTGGTGGT CTCGGTGCGA TCCAGACCGC GATGATTATC GGCGCGCTGC CCTTCTCGGT GGTGATGGCG TTGATGGGCG TTGCCTTGAT CAAGGCCATC ATCCTCGACG GGCTTCGGGG CCGCGCGGGA CTGCCAGTTA CGGCAGATCA GCTCGCGGGC GAGTAG
|
Protein sequence | MPVFCAKFAY GRDAAARRDW ICGFATWLSA SRHPPEFQSQ RMPHRRVFLS DRSWRRWRQI ETIAQSTCRL LLRWQPRLRP AHSPEVAFPQ QRGPTTMPLK PPLTELEIKT ADSGFYKGFT KDVTITAKIL VAALILWAVA FPNQAASVLS HLNGVILASF NYWYIYVVAF FVVVSLGLAI WPASGRLRLG AEHEKPEFSA FSWFSMMFGA GMGIGVLTYA TGEPIYHFQN NPDVIQGFVE GASAETVRSA YKWSFLHWGL SPWGAYALTG LALAFFAYRR GLPLTIRSAL TPLFGDRLSG FAGHAVDVVA VIATVLGVAQ TLGFGVEQFV AGLSRIGFGD WLLVTTDEGT QKASVVAIVL SLVIIMGAST LSALSGVGKG IKWLSNINMG LSVFLLSFFL IFGSTVFGLT ALVTGIYDYL LAFPAMLFTV WSDKGTETSS ALESWQGGWT IFYWAWWIAF APFVGLFLAR ISRGRTIREY VFGAMLVPSV MCFVWFALIG GTAIDLELSG IAEGAILGTG LSDQLYATLA VLLSDGLAWV FSVVVAVLLM TYLVTSADSA VLIINTINAA GDEGPKARPH IIFWGVALAL VVGALLIIGG LGAIQTAMII GALPFSVVMA LMGVALIKAI ILDGLRGRAG LPVTADQLAG E
|
| |