Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4138 |
Symbol | |
ID | 4447606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4654508 |
End bp | 4656007 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691969 |
Product | phosphoesterase, PA-phosphatase related |
Protein accession | YP_833613 |
Protein GI | 116672680 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG0671] Membrane-associated phospholipid phosphatase [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAGGA TGCTGAGGAA AGGCCCGGGC CTGGTGGCCG GACTGGACCG GACATTACTC AAAGCCGTGT CCAACCTGCC GGGCGGAAAC CATGATGTCC TGTTCCGCCG GCTCTCGGCG GCCGCGAACC ATGGAAAGTT GTGGGTTGCG GTGGCCGGAG GGCTGGCCCT CATACCCGGC AGGCCCCGCC GTGCGGCCTT GCACGGGCTG ATTGCCCAGG GCGTGGCATC GGCCGTCACC AACGTGGTGT TCAAAACGCT CCTTCCCCGG GCCCGGCCAC TGCCTGAGCA TCTTCCCGTG TTCCGGTTCG TCAACCCTCA ACCCACCAGC TCTTCGATGC CATCAGGCCA TTCCGCGTCC GCCGCGGCTT TTGCCGTGGG GGTGGGGCTG GTGCAGCCGG CCATTGGCGT AGCGCTGGCA CCCCTGGCCG CCGGCGTTGC ATATTCCCGG GTCCACACCG GCGCCCACTG GCCTTCGGAT GTACTGTTCG GGTCGGCACT GGGAGCCGGA GCGGCCATGA TCACCCGTAA GTGGTGGCCC GTGCGGCCGC CCTTCCCGGA AACCGCCCGC ACCCCCGCCC ACGCGGCCAC ATTGCCCGGC GGCGAAGGCC TTAGCATCGC GGTCAACACG CTGGGCGGTT CATATGCGCC GGAAACCATA AAGGCATTGC AGGATGTATT CCCATCAGCG CATATACATG AGATTGACGA GGACGGGGAC GTCGCCGCAG AGATCAGGGC GGTGGCCGAC CGGCCCGGAG TAAAGGCGCT CGGCGTCTGG GGCGGTGACG GAACGGTTGG CACCGCGGCT GCAGTCGCGG TCGAACTTTC CCTGCCCTTG CTGGTGCTCC CCGGCGGGAC GCTCAACCAC TTTGCCCGCG ACGCCGGAAC CTCCACCCTC GAGGACGCCG TGGAGGCGGC TTCGGCCGGT GCGGCTGCCC GGGCGGATGT GGGGCACGTC CGGGTGGAAC GGGGGCTCGC CGGAAGCCCC GAAAGGCTGG AACGGACCAT GCTCAACACG GCGAGCATTG GCCTCTACCC CAACCTGGTA CGCCGGCGGG AACAGCTCCG GCCGGCCCTC GGCAAACCGC TGGCCGGAGT GGCCGCAATG TTCCGGACGT TCGCGGCCGG CACGCCGACC ACCATGATTG TGAACGGCGT CCGCCACAAG CTATGGATTC TTTACGTCGG CCGGGGCCGC TACTACCCCC GCGACCATGC ACCGCTGGTG CGGCCGGTGA TGGACGACGG CGTCCTGGAC CTGCGGATGA TCACTGCCGA TGAGTCGTTC GCACGGATCC GGCTCCTGTG GTCCGTCCTG ACGGGCACGG TGGCCACGTC CAAGATCACG CACTTGAGTG AGTCCACGAA GGTGACGGTG GAAGCAGCGG GGTCGCCCAT GGCGCTGGCC GTGGACGGGG AAGCCATGCC GGGCGTGCGC CGCGTTGAGT TTTCGGTGCT CCCCGGAGCG CTGACCTACT ACTCCCCGAC ACCGGCGTAG
|
Protein sequence | MRRMLRKGPG LVAGLDRTLL KAVSNLPGGN HDVLFRRLSA AANHGKLWVA VAGGLALIPG RPRRAALHGL IAQGVASAVT NVVFKTLLPR ARPLPEHLPV FRFVNPQPTS SSMPSGHSAS AAAFAVGVGL VQPAIGVALA PLAAGVAYSR VHTGAHWPSD VLFGSALGAG AAMITRKWWP VRPPFPETAR TPAHAATLPG GEGLSIAVNT LGGSYAPETI KALQDVFPSA HIHEIDEDGD VAAEIRAVAD RPGVKALGVW GGDGTVGTAA AVAVELSLPL LVLPGGTLNH FARDAGTSTL EDAVEAASAG AAARADVGHV RVERGLAGSP ERLERTMLNT ASIGLYPNLV RRREQLRPAL GKPLAGVAAM FRTFAAGTPT TMIVNGVRHK LWILYVGRGR YYPRDHAPLV RPVMDDGVLD LRMITADESF ARIRLLWSVL TGTVATSKIT HLSESTKVTV EAAGSPMALA VDGEAMPGVR RVEFSVLPGA LTYYSPTPA
|
| |