Gene Arth_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0102 
Symbol 
ID4447435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp104219 
End bp105274 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID639687897 
Productbile acid:sodium symporter 
Protein accessionYP_829603 
Protein GI116668670 
COG category[R] General function prediction only 
COG ID[COG0385] Predicted Na+-dependent transporter 
TIGRFAM ID[TIGR00841] bile acid transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAGG CAACCAAATC CCACGAACAC CCCAAAGCCG GCGCCGATGC TGATGCCGCA 
CCGCTGAATC CGGCACTGGC GGCAGAGGCC AAAATCGCGC GAATCGCGGT CACCGTCTTT
CCGCTGCTGG TGGTGGCTGC CGGCATTGCC GGTTTCCTGC TGCCGGGCGC CTTCAAGCCG
ATGGCCCCCG GCGTCCCGTA CTTGCTGGGC GTCATCATGT TCTGCATGGG CCTGACGCTC
ACCCCGCCGG ACTTCGCGTC CGTGGTCAAG CGGCCCTGGG CCGTGGTTCT GGGCATCGTG
GCCCACTACG TGATCATGCC GGGCGCCGGC TGGCTGATTG CCGTGGCGCT CAACCTCCCG
CCCGAGCTGG CCGTGGGCCT CATTCTGGTG GGCTGCGCGC CGTCCGGGAC CGCCTCCAAT
GTGATGGCCT TCCTGGCCAA GGGGGACGTT GCCCTCTCGG TGGCCGTGGC CTCGGTCTCC
ACGCTGATCG CCCCGATCGT CACTCCCCTG CTGGTCCTAT TCCTGGCCGG ATCCTTCCTG
CAGATCGACG CCGGAGCGAT GGTCGTGGAC ATCGTCAAGA CCGTCCTCCT CCCGGTGATT
GCAGGCCTGC TGGCACGGCT GTTCCTCAAG AAGCTCGTCG CGAAGGTGCT TCCGGCACTC
CCCTGGGCCT CCGCCGTCGT GATTTCCCTG ATTGTGGCGA TCGTGGTGGC TGGCAGCGCC
AGCAAGATCG TGGCCGCCGG CGGCATCGTG TTCCTCGCCG TTGTGCTGCA CAACGGCTTT
GGCCTGGGCC TCGGATACCT CGCCGGCAAG CTCGGCAGGC TGGATGACAA GGCCCGCCGC
GCGCTGGCCT TTGAAGTCGA AATGCAGAAC TCCGGGCTGG CCGCCACACT GGCCACCGCG
CACTTCAGTC CGCTGGCCGC ACTGCCCTCG GCGGTGTTCT CGCTATGGCA CAACATCTCG
GGCGCGATTG TGGCCGCATG GCTGGCCCGG CGCCCGCTGA CTGATGCCCC TGGCCGCGAT
GCCCAGGTTC ATAGCGCAGC CGCCCGGGAC GCCTGA
 
Protein sequence
MLEATKSHEH PKAGADADAA PLNPALAAEA KIARIAVTVF PLLVVAAGIA GFLLPGAFKP 
MAPGVPYLLG VIMFCMGLTL TPPDFASVVK RPWAVVLGIV AHYVIMPGAG WLIAVALNLP
PELAVGLILV GCAPSGTASN VMAFLAKGDV ALSVAVASVS TLIAPIVTPL LVLFLAGSFL
QIDAGAMVVD IVKTVLLPVI AGLLARLFLK KLVAKVLPAL PWASAVVISL IVAIVVAGSA
SKIVAAGGIV FLAVVLHNGF GLGLGYLAGK LGRLDDKARR ALAFEVEMQN SGLAATLATA
HFSPLAALPS AVFSLWHNIS GAIVAAWLAR RPLTDAPGRD AQVHSAAARD A