Gene Arth_3745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3745 
Symbol 
ID4447795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4220512 
End bp4222686 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content66% 
IMG OID639691569 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_833220 
Protein GI116672287 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACCAA TGAGCCACAA CTCCTCAGCG CGCCCCGGCG CAGCGCCGGC CGCTGGCGCA 
CCCTCACCAC CAAACCAGTC ACCGTCGCAG GACGCCGCGC CGGAACGCAC CACGCGGGTC
AACAAGACCG TCTTTTTCGG CTCGGCTGCG GGCGTGGTGG GCATCGCGCT CTGGGCCATG
CTCGCCAAGG ACAACGCCGA GGCCGTCATC GGGTCCATGG TGGGCTGGGT GTCCACCAAC
ATGGGCTGGT ACTACTTCCT GATTGTCACG GCCGTGGTGG TTTTCGTCCT CGTGGCCGCG
CTGTCACGGG TGGGCAAGAC CAAGCTCGGA CCGGACCATT CCAAGCCCCA GTTCGGCATG
TTCACCTGGG CCGCCATGCT GTTCGCCGCC GGCATCGGGA TCGACCTGAT GTTCTTCTCC
GTGTCCGAAC CGGTCAGCCA GTACCTTGCC CCGCCGCAGG GCGAAGGAGG TACCGCTGAG
GCCGCCCGGC AGGCCCTGGT CTGGACCCTG TTCCACTACG GCATCACCGG CTGGGCGCTC
TACGCGCTGA TGGGGTTGGC GCTGGGCTAC TTCGCCTACC GCCACAATCT TCCGCTAAGC
ATCCGCTCCG CGCTGTACCC GATCTTCGGC AAGAAGATCG AGGGCCCGCT GGGGCATGCC
GTGGACATCG CCGCCCTGCT CGGCACCATC TTCGGTATCG CCACTTCGCT GGGCATTGGG
GTGGTCCAGC TCAACTACGG CCTGAACTTC ATGTTCGGCA TCCCGGAGAA CCTCGCCGTC
CAGATCGGCC TGATCGCCCT GTCCGTGGTG ATGGCCACTG TCTCGGTGGT GTCCGGCGTC
GAGAAGGGCA TCCGGCGGCT GTCCGAACTC AACGTGATCC TGGCCGTGGC CCTGATGCTG
TTCGTGCTGG TGACCGGCAA GACCAGCTTC CTGCTGGACG GCATCGTGCA GAACGTCGGC
GACGTCATGA GCCGTTTCCC GGCCATGACC CTGGACACCT TCGCGTACGA CCGGCCCACC
GACTGGCTGA ATGCCTGGAC CCTGTTCTTC TGGGCCTGGT GGATCGCCTG GGCGCCGTTC
GTCGGCCTGT TCCTGGCCCG CATCTCCCGC GGCCGCACCA TCCGCCAGTT CGTGCTCGGC
ACCATGACCG TCCCGTTCCT CTTCATCCTG CTGTGGATTT CCGTGTTCGG GAACTCCAGC
ATCGACCTGA TCATGAACGG CAACGCGGCC TTCGGCGAAG CCGCGATGAG CCACCCGGAG
CGCGGCTTCT ACAGCCTGCT GTCCCAGTAC CCCGGCGTTC CGGTGACGGC CGCCGTCGCC
ACCTTCACGG GACTGCTGTT CTACGTGACC TCCGCCGACT CCGGAGCCCT GGTGATGGCC
AACTTCACCT CGCACCTCAA GGACGCGGAC GCCGACGGTC CGGAATGGAT GCGCGTGTTC
TGGGCGGTGG CAACCGGCCT GCTGACGCTA GCCATGCTGA TGGTGGGCGG CGTGGCCACC
CTGCAGAACG CCACAATTGT CATGGGCCTG CCGCTGTCCC TGCTCCTGGT TTTCATCATG
CTGGGGCTGT ACAAGGCGCT GCGGGTGGAG AACTCGCTCA ACGACAGCTA CCGCGCCAGC
CTGCCGGGCA TCATCACCGG CCGTGCGGGC GAACAGCGCG GCGGCCGCAG CTGGCGCCAG
CGCCTCACCC GGGCCATGAG CTATCCCGGC CGCCGGCAGA CCACGCGGTT CGTCGAAACC
GTGGCCGTGC CGGCGCTCCG GGAGGTCAGC GAGGAACTGA AGGCCCAGGG CGCCGAGACC
GTGCTTTCGG TGTCCACGGT GGAGTCCTGC GGGATCGACA GTGCGGACCT GCAGCTCGCC
ATGGGCGAGG AGCGGCCGTT CAAATACCAG ATCTACCCGG TGCAGTACGA GACGCCGTCC
TACGCCACCC GGCGGGCGGA CCCGGACGAC CACTACTACC GGATGGAAGT GTTCTCCCTC
GAGGGCAGCC ACGGCTACGA CCTCATGGGC TACACGAAGG AACAGGTCAT CACCGATGTC
CTGGACCACT ACGAACAGCA CCTGGAATTC CTGCACCTGA ACCGTGCAGC GCCGGGCAAC
ACCGTCCTGG TCGAAGACCA GGTGGCTAAA GACAATTGGG AATCAGACTT CGAAACGCAG
GAGGCAGCGA AATGA
 
Protein sequence
MGPMSHNSSA RPGAAPAAGA PSPPNQSPSQ DAAPERTTRV NKTVFFGSAA GVVGIALWAM 
LAKDNAEAVI GSMVGWVSTN MGWYYFLIVT AVVVFVLVAA LSRVGKTKLG PDHSKPQFGM
FTWAAMLFAA GIGIDLMFFS VSEPVSQYLA PPQGEGGTAE AARQALVWTL FHYGITGWAL
YALMGLALGY FAYRHNLPLS IRSALYPIFG KKIEGPLGHA VDIAALLGTI FGIATSLGIG
VVQLNYGLNF MFGIPENLAV QIGLIALSVV MATVSVVSGV EKGIRRLSEL NVILAVALML
FVLVTGKTSF LLDGIVQNVG DVMSRFPAMT LDTFAYDRPT DWLNAWTLFF WAWWIAWAPF
VGLFLARISR GRTIRQFVLG TMTVPFLFIL LWISVFGNSS IDLIMNGNAA FGEAAMSHPE
RGFYSLLSQY PGVPVTAAVA TFTGLLFYVT SADSGALVMA NFTSHLKDAD ADGPEWMRVF
WAVATGLLTL AMLMVGGVAT LQNATIVMGL PLSLLLVFIM LGLYKALRVE NSLNDSYRAS
LPGIITGRAG EQRGGRSWRQ RLTRAMSYPG RRQTTRFVET VAVPALREVS EELKAQGAET
VLSVSTVESC GIDSADLQLA MGEERPFKYQ IYPVQYETPS YATRRADPDD HYYRMEVFSL
EGSHGYDLMG YTKEQVITDV LDHYEQHLEF LHLNRAAPGN TVLVEDQVAK DNWESDFETQ
EAAK