Gene Arth_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3716 
Symbol 
ID4443717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4181744 
End bp4183633 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content65% 
IMG OID639691540 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_833191 
Protein GI116672258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.903721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCC CCCCCGTTGA GCTGGAGGAG CCCAAGCTCG TTCCTGCGTC CGAGAACTCT 
CCTTCCTCCC ACGATTCACA CGATGCCGAG GACACCGAAC AGATCATGCA GGAACTCCGC
CAGACCAAAG CCGAGCAGGC TGTGGCAGAG CGGCGGAACC GCAAACTCAC CCTCGACAAG
GCCACCTTCG GCATCACCGG GGTTCTGGCC CTGGCCTTCG TGGCGTGGGG CTTCCTGGGC
CGGGACAGCC TGGCCGCCAC GTCAACAGCC GCCCTCGACT GGGTAATGGA GTACACCGGC
TGGCTCTTTA TGGTCCTGGC CTCGCTGTTC GTTGTTTTTG TGCTCTGGCT GGCCCTGGGC
AAGTGGGGCA ACATTCCCCT CGGCAAGGAC GGCGAGAAGC CCGAGTTCCG GACCGTGTCA
TGGATTGCCA TGATGTTTGC GGCCGGCATG GGCATCGGCC TGATGTTCTA CGGTGTGGCC
GAGCCGCTGT ACCACTACAT CTCCCCTCCG CCGGGAACGG TGGACGGACG CACGCCGGCA
GCCATCCAGA CGGCCATGGC AACCTCCATC TTCCACTGGA CCCTGCACCC CTGGGCGATG
TACGCCGTCG TCGGCATTGC CATGGCCTAC GGCACCTACC GCCTGGGCCG CCGCCAGCTG
ATCTCGGCAG CGTTCACGTC GCTGTTCGGC ATCAGGACGG TGGAAGGGCC GGTGGGCAAG
TTCATCAACA TCCTGGCGAT CTTCGCCACG CTCTTCGGCA CGGCCGCTTC CCTGGGCCTC
GGCGCCCTCC AGATCGGCAG CGGCCTGACG TCGAACGGCT GGATCGGGGA AGTGGGAACA
CCCATCCTGG TGGCCATCGT TGCCGTCCTG ACAGCCTGCT TCGTGGCCTC GGCGGTGTCC
GGCATCAGCC GCGGCATCCA GTGGCTGTCC AACATCAACA TGGTCCTGGC CGTGATCCTG
GCGCTCATCG TTTTCCTCGC CGGGCCCACA CTGTTTATCC TCAACCTCAT TCCCGCAGCA
GTCGGTGACT ACGCCAGGGA CCTGGCCGAG ATGTCCTCCC GCACCGAAGC CGTGGGCGAC
GAGGCGCTCC GGACCTGGAT GTCCGGCTGG ACCATCTTCT ACTGGGCCTG GTGGGTATCG
TGGACGCCGT TTGTGGGCAT GTTCATCGCC CGCATCAGCC GCGGCCGCAC CATCCGCCAG
TTCGTCACCG GCGTCCTGCT GGTCCCCAGC ATCGTCAGCG TCATCTGGTT CGGTATCTTC
GGCGGCACTG CGTTCCACGT CCAGGAGGAA GCGGACAAGG CGGGTACCCC CGGACTGGTG
TCCATGGCCA GCGGGTCACC GTCCATCGAC TTCGACGGCG CCCTGTTCGA CCTCGTCCGG
AACATGTCCA TGCCTGCCTG GCTCACCGCA GCAGTGGTGG TTCTCGCCAT GGTCCTGGTG
GCCATCTTCT TCATCACCGG AGCCGATTCC GCATCGATCA TCATGGCATC CCTGAGCTCC
AACGGGTCCT CCGACCCGAA GCGCGGCCTG GTCATCTTCT GGGGCCTGCT CACCGGCGCA
GTAGCCGCCG TCATGATGCT GGCCGGCGGC GATGAACCCT CGGAGGCGCT CTCCGGCCTG
CAGCGGATCA CCATCGTGGC GGCCCTGCCG TTTGTGCTGG TCATGCTGCT GCTGTGCTTC
GCCCTGGTCA AGGACCTGCG CCGGGATCCG CTGTCCCTCC GGCGCCGACT GGCGGACTCG
GTAGTGGAGC GCGCCATTCG CAGCGGCGTG GACCAACACG GCGGCGTCCA GTTCGATCTC
GTTACCAAAC ATCAATGTGA TCAGCGCTGT CCGGACGACG GCCGCTGCGC GGCTTCCCCC
GCTGACACCC AAGCCCCCGC AAAGAATTAG
 
Protein sequence
MPTPPVELEE PKLVPASENS PSSHDSHDAE DTEQIMQELR QTKAEQAVAE RRNRKLTLDK 
ATFGITGVLA LAFVAWGFLG RDSLAATSTA ALDWVMEYTG WLFMVLASLF VVFVLWLALG
KWGNIPLGKD GEKPEFRTVS WIAMMFAAGM GIGLMFYGVA EPLYHYISPP PGTVDGRTPA
AIQTAMATSI FHWTLHPWAM YAVVGIAMAY GTYRLGRRQL ISAAFTSLFG IRTVEGPVGK
FINILAIFAT LFGTAASLGL GALQIGSGLT SNGWIGEVGT PILVAIVAVL TACFVASAVS
GISRGIQWLS NINMVLAVIL ALIVFLAGPT LFILNLIPAA VGDYARDLAE MSSRTEAVGD
EALRTWMSGW TIFYWAWWVS WTPFVGMFIA RISRGRTIRQ FVTGVLLVPS IVSVIWFGIF
GGTAFHVQEE ADKAGTPGLV SMASGSPSID FDGALFDLVR NMSMPAWLTA AVVVLAMVLV
AIFFITGADS ASIIMASLSS NGSSDPKRGL VIFWGLLTGA VAAVMMLAGG DEPSEALSGL
QRITIVAALP FVLVMLLLCF ALVKDLRRDP LSLRRRLADS VVERAIRSGV DQHGGVQFDL
VTKHQCDQRC PDDGRCAASP ADTQAPAKN