Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3780 |
Symbol | |
ID | 4447830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4262548 |
End bp | 4263738 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691604 |
Product | NLPA lipoprotein |
Protein accession | YP_833255 |
Protein GI | 116672322 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCGC ATCAGAACCC AGAAGGATCC ACCCGCATCG TGGCGGGCCA AAGCAGCAAA GTCCAAAACT CCGCAGCAAA GCGCAAACGC GCCCTCGGAA TCGGCATCGC GGCCGGGCTC GTCGCCCTGA TCGCCGGCGG CGCGGCAGTG GCGTCGAACC TCGCCCGCAG CACCGAATCA CAGCCGGCCG CAGCAGCTGC CACCGGCACG CCTGCCGCGG AGTTGAAGCT TGGCTTCTTC GGCAACGTCA CGCACGCCCC GGCGCTGGTG GGGGTGAAGG AAGGTTTCAT TGCCGGGAGC CTGGGCGGGA CGAAGCTGAG CACGCAGGTG TTCAATTCCG GTCCGGCCGC GATCGAGGCG CTGAACGCCG GCGCGATCGA CGCCACGTAT ATCGGCCCGA ACCCGGCGAT CAACTCCTTC GTGAAGAGCC GGGGCGAGTC AGTGAGCATC ATTGCCGGCG CCGCGGCGGG CGGCGCCCAG CTGGTGGTGA AGCCGGAGAT CGGCTCGGCC GCGGATCTGA GGGGCAAGAC CCTGTCTACG CCGCAGCTGG GCGGGACCCA GGACGTGGCG CTGCGCGCCT GGCTCGCCGG GCAGGGGTAC AAGACGAACA CGGACGGCAG CGGGGATGTG GCGATCAACC CGACCGAGAA CGCGCAGACG CTGAAGCTGT TCCAGGACGG CAAGCTCGAC GGCGCGTGGC TGCCGGAACC GTGGGCGTCC CGGCTGGTGC TGCAGGCCGG CGCGAAGGTC CTGGTGGACG AGAAGGATTT GTGGGACGGG TCGCTGACGG GCAAGCCAGG CGAGTTCCCC ACCACCATCC TGATCGTGAA CAAGAAGTTC GCCGCTGACC ACCCGGACAC CGTCAAGGCC CTGCTGAAGG GCCACGCCGA GTCCGTGGCC TGGCTCAACT CCGCGGCCGC CGCCGAGAAG TCCACTGTCA TCAATGCCGC CCTCAAGGAA GCGTCCGGCG CCGAGCTGAA AGCCGACGTC ATTGAACGGT CCCTGAAGAA CATCGTCTTC ACCGTGGATC CGCTGGCCGG AACCTACAAA AAGCTGCTTG AGGACGGGGT GAAGGCCGGC ACCACCAAGC AGGCGGACAT CACCGGCATC TTCGACCTCA CCGCCCTGAA CAGCGTCACC GCCGAAACAG GCGGCAGTAA GGTCTCCGCC GCCGGACTCG GCACGGACTG A
|
Protein sequence | MSSHQNPEGS TRIVAGQSSK VQNSAAKRKR ALGIGIAAGL VALIAGGAAV ASNLARSTES QPAAAAATGT PAAELKLGFF GNVTHAPALV GVKEGFIAGS LGGTKLSTQV FNSGPAAIEA LNAGAIDATY IGPNPAINSF VKSRGESVSI IAGAAAGGAQ LVVKPEIGSA ADLRGKTLST PQLGGTQDVA LRAWLAGQGY KTNTDGSGDV AINPTENAQT LKLFQDGKLD GAWLPEPWAS RLVLQAGAKV LVDEKDLWDG SLTGKPGEFP TTILIVNKKF AADHPDTVKA LLKGHAESVA WLNSAAAAEK STVINAALKE ASGAELKADV IERSLKNIVF TVDPLAGTYK KLLEDGVKAG TTKQADITGI FDLTALNSVT AETGGSKVSA AGLGTD
|
| |