Gene Arth_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3117 
Symbol 
ID4444350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3497692 
End bp3498849 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID639690944 
ProductNLPA lipoprotein 
Protein accessionYP_832596 
Protein GI116671663 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGTC CCCAGCCCGG AATGATCCGC ATCGTGGCAG GAGAGAGCAA CAACCCGAAG 
CGCAAGCGCG CCGTGGAGGT CCTGGTTGCC CTCGGCCTGG TGCTGCTTAT TGCCGCCGGT
GCCGTGGTGG CGTCAACCCT TTCCCGCGAT TCCTCAGCCC AGGCAGCCGC CCCCGCGCCT
GCCGCGGAGT TGAAGCTTGG CTTCTTCAGC AATGTCACGC ACGCCCCGGC GCTGGTGGGG
GTGAAGGAAG GTTTCATTGC CGGGAGCCTG GGCGGGACGA AGCTGAGCAC GCAGGTGTTC
AATGCCGGTC CGGCCGCGAT CGAGGCGCTG AACGCCGGCG CGATCGACGC CACGTATATC
GGCCCGAACC CGGCGATCAA CTCCTTCGTG AAGAGCCGGG GCGAGTCCGT GAGCATCATT
GCCGGCGCCG CGGCGGGCGG CGCCCAGCTG GTGGTGAAGC CGGAGATCGG CTCGGCCGCG
GATTTGAGGG GCAAGACCCT GTCTACGCCG CAGCTGGGCG GGACCCAGGA CGTGGCGCTG
CGCGCCTGGC TCGCCGGGCA GGGGTACAAG ACGAACACGG ACGGCAGCGG GGATGTGGCG
ATCAACCCGA CCGAGAACGC GCAGACGCTG AAGCTGTTCC AGGACGGCAA GCTCGACGGC
GCGTGGCTGC CGGAACCGTG GGCGTCCCGG CTGGTGCTGC AGGCCGGCGC GAAGGTCCTG
GTGGACGAGA AGGATTTGTG GGACGGGTCG CTGACGGGCA AGCCGGGCGA GTTCCCCACC
ACCATCCTGA TCGTGAACAA GAAGTTCGCC GCTGACCACC CGGACACCGT CAAGGCCCTG
CTGAAGGGCC ACGCCGAGTC CGTGGCCTGG CTCAACTCCG CGGCCGCCGC GGAAAAGGCG
GGCGTGCTGA ATGCCGCGCT GAAGGAATTC GGAAGCGCCG AACTCCCGGC CGACGTCATT
GAGCGGTCCC TGAAGAACAT CGTCTTCACC GTGGATCCGC TGGCCGGAAC CTACAAAAAG
CTGCTTGAGG ACGGGGTGAA GGCCGGTACC ACCAAGCAGG CGGACATCAC CGGCATCTTC
GACCTCACCG CCCTGAACAG CGTCACCGCC GAAACAGGCG GCAGCAAGGT CTCCGCCGCC
GGACTCGGCA ACGACTAA
 
Protein sequence
MTSPQPGMIR IVAGESNNPK RKRAVEVLVA LGLVLLIAAG AVVASTLSRD SSAQAAAPAP 
AAELKLGFFS NVTHAPALVG VKEGFIAGSL GGTKLSTQVF NAGPAAIEAL NAGAIDATYI
GPNPAINSFV KSRGESVSII AGAAAGGAQL VVKPEIGSAA DLRGKTLSTP QLGGTQDVAL
RAWLAGQGYK TNTDGSGDVA INPTENAQTL KLFQDGKLDG AWLPEPWASR LVLQAGAKVL
VDEKDLWDGS LTGKPGEFPT TILIVNKKFA ADHPDTVKAL LKGHAESVAW LNSAAAAEKA
GVLNAALKEF GSAELPADVI ERSLKNIVFT VDPLAGTYKK LLEDGVKAGT TKQADITGIF
DLTALNSVTA ETGGSKVSAA GLGND