Gene Achl_3496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3496 
Symbol 
ID7294977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3870506 
End bp3872680 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content64% 
IMG OID643591902 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_002489541 
Protein GI220914232 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.81429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA ATGCAGTACC GCCCGTGGAC ACCACGGGTC CTGAGAAAGA AGAACGAAAG 
CCACCCGCGA ACGGCGGAAA TACCAACCAC AGCGCCGGCC CGGCGGCCAA AACCCGGGTC
AACAAGACAG TCTTCCTGGG CTCCGCCACC GGCGTGGTGG CTATCGCCCT GTGGGCCATC
ATGGCCAAGG ACAACGCGGA GGCGGTCATC GGGGCCATGG TGGGCTGGGT GTCCACCAAC
ATGGGGTGGT ACTACTTCAT GATCGTCACC GCCGTGGTCA TCTTCGTCCT GGTGGCCGCC
CTCTCCCGGG TGGGAAAGAC GAAGCTCGGG CCGGACCACT CCAAACCACA GTTCGGCATG
TTCACCTGGG CGGCCATGCT GTTTGCCGCC GGCATAGGCA TCGACCTGAT GTTCTTCTCC
GTGTCTGAAC CGGTGAGCCA GTACCTGGCC CCGCCGCAGG GCGAGGGCGG AACCGCGGAG
GCCGCCCGCC AGGCGCTGGT GTGGACGCTG TTCCACTACG GCATCACAGG CTGGGCGCTG
TACGCCCTCA TGGGGCTCGC GCTGGGCTAC TTCTCCTACC GGCACAACCT GCCGCTGAGC
ATCCGCTCCG CCCTCTACCC CATCTTCGGC AAGAAGATCG AGGGCCCAGT AGGGCACGCG
GTGGACATCG CGGCCCTCCT TGGCACCATC TTCGGGATCG CCACGTCACT GGGCATCGGC
GTCGTCCAGC TCAACTACGG GCTGAACTTC ATGTTCGGCA TTCCGGAGGA CCTGGCCGTG
CAGATCGGCC TGATCGTCCT GTCCGTAGTG ATGGCTACCG TGTCCGTGGT TTCCGGCGTC
GAGAAGGGAA TCCGGCGGCT CTCCGAGCTC AACGTGATCC TGGCCGTCGC CCTGATGCTT
TTCGTCCTGG TCACCGGTAA GACCAGCTTC CTGCTGGACG GGATCGCCCA AAACATCGGT
GACGTTATGA GCCGGTTCCC CGCGATGACC CTGGACACGT TCGCCTATGA CCGGCCCACG
GACTGGATGA ACGCCTGGAC CCTGTTCTTC TGGGCCTGGT GGATCGCCTG GGCACCGTTC
GTCGGCCTGT TCCTGGCCCG CATTTCCCGC GGCCGCACCA TCCGCCAGTT CGTCCTGGGC
ACCATGACCG TGCCGTTCAT CTTCATCGTG CTGTGGATCT CCATCTTTGG AAACTCCAGC
CTCGACCTGA TCATGAACGG CAACGCGGCG TTCGGCGAGG CGGCCATGAG CCACCCGGAA
CGCGGCTTCT ACAGCCTGCT GGAACAGTTC CCCGCCGTGC CCGTCACCGC CGCGGTGGCC
ACGTTCACCG GGTTGCTCTT CTACGTGACC TCGGCCGATT CCGGCGCCCT GGTGATGTCC
AACTTCACCT CGCACCTCAA GGATGCCGAT TCCGACGGCC CCGAGTGGAT GCGCGTCTTC
TGGGCAGTGG CCACCGGCCT GCTGACCCTG GCCATGCTGA CGGTGGGTGG TGTTCCCACG
CTGCAGAACG CGACCATCAT CATGGGACTG CCGCTCTCGC TGCTGCTGGT GCTGATCATG
CTGGGGCTCT ACAAGGCGCT GCGGGTGGAG AACTCGCTCA ACGACAGCTA CCGCGCCAGC
CTGCCTGGCA TCATTACCGG CCGGTCCCTG GACCAGCGGG GCGGACGGAC CTGGCGGCAG
CGGCTCAGCC GCGCCATGAG CTACCCGGGC CGCAAACAGA CAACGCGCTT CGCCGAAACC
GTGGCCCTGC CGGCCCTGAA GGATGTCGAG GCGGAGCTCA AGTCCCAGGG TGCCGAGACG
TCCCTGACCG TAACCACCGT GGAGCCGTGC GGCATCAACA GCATCGACCT GCAGCTGGCC
ATGGGGGAGG AACGGGCGTT CAAGTACCAG ATCTACCCGG TCCAGTATGA GACGCCCAGC
TACGCCACCC GCCGTGCCGA TCCGGAAGAC CGCTACTACC GGATGGAGGT GTTCTCGCAG
GAGGGCAGCC ACGGCTACGA CCTCATGGGG TACACCCGCG AGCAGGTCAT CACCGATGTG
CTGGACCACT ATGAGCAGCA CCTTGAATTC CTGCACCTGA ACCGTGCAGC ACCCGGAAAC
ACCGTCCTGG TCGAGGACCA GGTGGCCAAA GACAATTGGG AATCAGACTT CGACATGCAG
GAGGAAACGA AATGA
 
Protein sequence
MSQNAVPPVD TTGPEKEERK PPANGGNTNH SAGPAAKTRV NKTVFLGSAT GVVAIALWAI 
MAKDNAEAVI GAMVGWVSTN MGWYYFMIVT AVVIFVLVAA LSRVGKTKLG PDHSKPQFGM
FTWAAMLFAA GIGIDLMFFS VSEPVSQYLA PPQGEGGTAE AARQALVWTL FHYGITGWAL
YALMGLALGY FSYRHNLPLS IRSALYPIFG KKIEGPVGHA VDIAALLGTI FGIATSLGIG
VVQLNYGLNF MFGIPEDLAV QIGLIVLSVV MATVSVVSGV EKGIRRLSEL NVILAVALML
FVLVTGKTSF LLDGIAQNIG DVMSRFPAMT LDTFAYDRPT DWMNAWTLFF WAWWIAWAPF
VGLFLARISR GRTIRQFVLG TMTVPFIFIV LWISIFGNSS LDLIMNGNAA FGEAAMSHPE
RGFYSLLEQF PAVPVTAAVA TFTGLLFYVT SADSGALVMS NFTSHLKDAD SDGPEWMRVF
WAVATGLLTL AMLTVGGVPT LQNATIIMGL PLSLLLVLIM LGLYKALRVE NSLNDSYRAS
LPGIITGRSL DQRGGRTWRQ RLSRAMSYPG RKQTTRFAET VALPALKDVE AELKSQGAET
SLTVTTVEPC GINSIDLQLA MGEERAFKYQ IYPVQYETPS YATRRADPED RYYRMEVFSQ
EGSHGYDLMG YTREQVITDV LDHYEQHLEF LHLNRAAPGN TVLVEDQVAK DNWESDFDMQ
EETK