Gene Arth_4138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4138 
Symbol 
ID4447606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4654508 
End bp4656007 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content68% 
IMG OID639691969 
Productphosphoesterase, PA-phosphatase related 
Protein accessionYP_833613 
Protein GI116672680 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0671] Membrane-associated phospholipid phosphatase
[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAGGA TGCTGAGGAA AGGCCCGGGC CTGGTGGCCG GACTGGACCG GACATTACTC 
AAAGCCGTGT CCAACCTGCC GGGCGGAAAC CATGATGTCC TGTTCCGCCG GCTCTCGGCG
GCCGCGAACC ATGGAAAGTT GTGGGTTGCG GTGGCCGGAG GGCTGGCCCT CATACCCGGC
AGGCCCCGCC GTGCGGCCTT GCACGGGCTG ATTGCCCAGG GCGTGGCATC GGCCGTCACC
AACGTGGTGT TCAAAACGCT CCTTCCCCGG GCCCGGCCAC TGCCTGAGCA TCTTCCCGTG
TTCCGGTTCG TCAACCCTCA ACCCACCAGC TCTTCGATGC CATCAGGCCA TTCCGCGTCC
GCCGCGGCTT TTGCCGTGGG GGTGGGGCTG GTGCAGCCGG CCATTGGCGT AGCGCTGGCA
CCCCTGGCCG CCGGCGTTGC ATATTCCCGG GTCCACACCG GCGCCCACTG GCCTTCGGAT
GTACTGTTCG GGTCGGCACT GGGAGCCGGA GCGGCCATGA TCACCCGTAA GTGGTGGCCC
GTGCGGCCGC CCTTCCCGGA AACCGCCCGC ACCCCCGCCC ACGCGGCCAC ATTGCCCGGC
GGCGAAGGCC TTAGCATCGC GGTCAACACG CTGGGCGGTT CATATGCGCC GGAAACCATA
AAGGCATTGC AGGATGTATT CCCATCAGCG CATATACATG AGATTGACGA GGACGGGGAC
GTCGCCGCAG AGATCAGGGC GGTGGCCGAC CGGCCCGGAG TAAAGGCGCT CGGCGTCTGG
GGCGGTGACG GAACGGTTGG CACCGCGGCT GCAGTCGCGG TCGAACTTTC CCTGCCCTTG
CTGGTGCTCC CCGGCGGGAC GCTCAACCAC TTTGCCCGCG ACGCCGGAAC CTCCACCCTC
GAGGACGCCG TGGAGGCGGC TTCGGCCGGT GCGGCTGCCC GGGCGGATGT GGGGCACGTC
CGGGTGGAAC GGGGGCTCGC CGGAAGCCCC GAAAGGCTGG AACGGACCAT GCTCAACACG
GCGAGCATTG GCCTCTACCC CAACCTGGTA CGCCGGCGGG AACAGCTCCG GCCGGCCCTC
GGCAAACCGC TGGCCGGAGT GGCCGCAATG TTCCGGACGT TCGCGGCCGG CACGCCGACC
ACCATGATTG TGAACGGCGT CCGCCACAAG CTATGGATTC TTTACGTCGG CCGGGGCCGC
TACTACCCCC GCGACCATGC ACCGCTGGTG CGGCCGGTGA TGGACGACGG CGTCCTGGAC
CTGCGGATGA TCACTGCCGA TGAGTCGTTC GCACGGATCC GGCTCCTGTG GTCCGTCCTG
ACGGGCACGG TGGCCACGTC CAAGATCACG CACTTGAGTG AGTCCACGAA GGTGACGGTG
GAAGCAGCGG GGTCGCCCAT GGCGCTGGCC GTGGACGGGG AAGCCATGCC GGGCGTGCGC
CGCGTTGAGT TTTCGGTGCT CCCCGGAGCG CTGACCTACT ACTCCCCGAC ACCGGCGTAG
 
Protein sequence
MRRMLRKGPG LVAGLDRTLL KAVSNLPGGN HDVLFRRLSA AANHGKLWVA VAGGLALIPG 
RPRRAALHGL IAQGVASAVT NVVFKTLLPR ARPLPEHLPV FRFVNPQPTS SSMPSGHSAS
AAAFAVGVGL VQPAIGVALA PLAAGVAYSR VHTGAHWPSD VLFGSALGAG AAMITRKWWP
VRPPFPETAR TPAHAATLPG GEGLSIAVNT LGGSYAPETI KALQDVFPSA HIHEIDEDGD
VAAEIRAVAD RPGVKALGVW GGDGTVGTAA AVAVELSLPL LVLPGGTLNH FARDAGTSTL
EDAVEAASAG AAARADVGHV RVERGLAGSP ERLERTMLNT ASIGLYPNLV RRREQLRPAL
GKPLAGVAAM FRTFAAGTPT TMIVNGVRHK LWILYVGRGR YYPRDHAPLV RPVMDDGVLD
LRMITADESF ARIRLLWSVL TGTVATSKIT HLSESTKVTV EAAGSPMALA VDGEAMPGVR
RVEFSVLPGA LTYYSPTPA