Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0102 |
Symbol | |
ID | 4447435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 104219 |
End bp | 105274 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639687897 |
Product | bile acid:sodium symporter |
Protein accession | YP_829603 |
Protein GI | 116668670 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | [TIGR00841] bile acid transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAGG CAACCAAATC CCACGAACAC CCCAAAGCCG GCGCCGATGC TGATGCCGCA CCGCTGAATC CGGCACTGGC GGCAGAGGCC AAAATCGCGC GAATCGCGGT CACCGTCTTT CCGCTGCTGG TGGTGGCTGC CGGCATTGCC GGTTTCCTGC TGCCGGGCGC CTTCAAGCCG ATGGCCCCCG GCGTCCCGTA CTTGCTGGGC GTCATCATGT TCTGCATGGG CCTGACGCTC ACCCCGCCGG ACTTCGCGTC CGTGGTCAAG CGGCCCTGGG CCGTGGTTCT GGGCATCGTG GCCCACTACG TGATCATGCC GGGCGCCGGC TGGCTGATTG CCGTGGCGCT CAACCTCCCG CCCGAGCTGG CCGTGGGCCT CATTCTGGTG GGCTGCGCGC CGTCCGGGAC CGCCTCCAAT GTGATGGCCT TCCTGGCCAA GGGGGACGTT GCCCTCTCGG TGGCCGTGGC CTCGGTCTCC ACGCTGATCG CCCCGATCGT CACTCCCCTG CTGGTCCTAT TCCTGGCCGG ATCCTTCCTG CAGATCGACG CCGGAGCGAT GGTCGTGGAC ATCGTCAAGA CCGTCCTCCT CCCGGTGATT GCAGGCCTGC TGGCACGGCT GTTCCTCAAG AAGCTCGTCG CGAAGGTGCT TCCGGCACTC CCCTGGGCCT CCGCCGTCGT GATTTCCCTG ATTGTGGCGA TCGTGGTGGC TGGCAGCGCC AGCAAGATCG TGGCCGCCGG CGGCATCGTG TTCCTCGCCG TTGTGCTGCA CAACGGCTTT GGCCTGGGCC TCGGATACCT CGCCGGCAAG CTCGGCAGGC TGGATGACAA GGCCCGCCGC GCGCTGGCCT TTGAAGTCGA AATGCAGAAC TCCGGGCTGG CCGCCACACT GGCCACCGCG CACTTCAGTC CGCTGGCCGC ACTGCCCTCG GCGGTGTTCT CGCTATGGCA CAACATCTCG GGCGCGATTG TGGCCGCATG GCTGGCCCGG CGCCCGCTGA CTGATGCCCC TGGCCGCGAT GCCCAGGTTC ATAGCGCAGC CGCCCGGGAC GCCTGA
|
Protein sequence | MLEATKSHEH PKAGADADAA PLNPALAAEA KIARIAVTVF PLLVVAAGIA GFLLPGAFKP MAPGVPYLLG VIMFCMGLTL TPPDFASVVK RPWAVVLGIV AHYVIMPGAG WLIAVALNLP PELAVGLILV GCAPSGTASN VMAFLAKGDV ALSVAVASVS TLIAPIVTPL LVLFLAGSFL QIDAGAMVVD IVKTVLLPVI AGLLARLFLK KLVAKVLPAL PWASAVVISL IVAIVVAGSA SKIVAAGGIV FLAVVLHNGF GLGLGYLAGK LGRLDDKARR ALAFEVEMQN SGLAATLATA HFSPLAALPS AVFSLWHNIS GAIVAAWLAR RPLTDAPGRD AQVHSAAARD A
|
| |