Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3117 |
Symbol | |
ID | 4444350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3497692 |
End bp | 3498849 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639690944 |
Product | NLPA lipoprotein |
Protein accession | YP_832596 |
Protein GI | 116671663 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAGTC CCCAGCCCGG AATGATCCGC ATCGTGGCAG GAGAGAGCAA CAACCCGAAG CGCAAGCGCG CCGTGGAGGT CCTGGTTGCC CTCGGCCTGG TGCTGCTTAT TGCCGCCGGT GCCGTGGTGG CGTCAACCCT TTCCCGCGAT TCCTCAGCCC AGGCAGCCGC CCCCGCGCCT GCCGCGGAGT TGAAGCTTGG CTTCTTCAGC AATGTCACGC ACGCCCCGGC GCTGGTGGGG GTGAAGGAAG GTTTCATTGC CGGGAGCCTG GGCGGGACGA AGCTGAGCAC GCAGGTGTTC AATGCCGGTC CGGCCGCGAT CGAGGCGCTG AACGCCGGCG CGATCGACGC CACGTATATC GGCCCGAACC CGGCGATCAA CTCCTTCGTG AAGAGCCGGG GCGAGTCCGT GAGCATCATT GCCGGCGCCG CGGCGGGCGG CGCCCAGCTG GTGGTGAAGC CGGAGATCGG CTCGGCCGCG GATTTGAGGG GCAAGACCCT GTCTACGCCG CAGCTGGGCG GGACCCAGGA CGTGGCGCTG CGCGCCTGGC TCGCCGGGCA GGGGTACAAG ACGAACACGG ACGGCAGCGG GGATGTGGCG ATCAACCCGA CCGAGAACGC GCAGACGCTG AAGCTGTTCC AGGACGGCAA GCTCGACGGC GCGTGGCTGC CGGAACCGTG GGCGTCCCGG CTGGTGCTGC AGGCCGGCGC GAAGGTCCTG GTGGACGAGA AGGATTTGTG GGACGGGTCG CTGACGGGCA AGCCGGGCGA GTTCCCCACC ACCATCCTGA TCGTGAACAA GAAGTTCGCC GCTGACCACC CGGACACCGT CAAGGCCCTG CTGAAGGGCC ACGCCGAGTC CGTGGCCTGG CTCAACTCCG CGGCCGCCGC GGAAAAGGCG GGCGTGCTGA ATGCCGCGCT GAAGGAATTC GGAAGCGCCG AACTCCCGGC CGACGTCATT GAGCGGTCCC TGAAGAACAT CGTCTTCACC GTGGATCCGC TGGCCGGAAC CTACAAAAAG CTGCTTGAGG ACGGGGTGAA GGCCGGTACC ACCAAGCAGG CGGACATCAC CGGCATCTTC GACCTCACCG CCCTGAACAG CGTCACCGCC GAAACAGGCG GCAGCAAGGT CTCCGCCGCC GGACTCGGCA ACGACTAA
|
Protein sequence | MTSPQPGMIR IVAGESNNPK RKRAVEVLVA LGLVLLIAAG AVVASTLSRD SSAQAAAPAP AAELKLGFFS NVTHAPALVG VKEGFIAGSL GGTKLSTQVF NAGPAAIEAL NAGAIDATYI GPNPAINSFV KSRGESVSII AGAAAGGAQL VVKPEIGSAA DLRGKTLSTP QLGGTQDVAL RAWLAGQGYK TNTDGSGDVA INPTENAQTL KLFQDGKLDG AWLPEPWASR LVLQAGAKVL VDEKDLWDGS LTGKPGEFPT TILIVNKKFA ADHPDTVKAL LKGHAESVAW LNSAAAAEKA GVLNAALKEF GSAELPADVI ERSLKNIVFT VDPLAGTYKK LLEDGVKAGT TKQADITGIF DLTALNSVTA ETGGSKVSAA GLGND
|
| |