Gene Arth_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1057 
Symbol 
ID4446457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1136171 
End bp1137811 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content69% 
IMG OID639688860 
Productalkaline phosphatase 
Protein accessionYP_830551 
Protein GI116669618 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3540] Phosphodiesterase/alkaline phosphatase D 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAGCA TGACTGCTTT CACCCGCCGT AATGTCCTCA AAGGTGCCCT GGCAGCCGCC 
GGTGCCGCCG CCGTCGTGCC CGCCGCGCTT TCGACAGGCT CAACCCCCGC TCACGCCGGC
GTCGCGCTGG TCCGCAACCG ACTGACCCTC CCGTCCGGGA TCGCCACCGG CGATGTCACC
GCCGATTCCG GCGTGCTGTG GTCCCGCGCG TCGGGCCCGG GCCGGCTGGT GGCCGGCCTG
CTCGCCGTGG ACGACGACGG TGCCCCCCTC CGCGGCGGCC GCGCCTTCCA GCGCGTGCTG
CGGGGGAGCG CCGCCAGCGA AGCCACCGAC TTCACGTCCC GGATCAATGC CGAGCACCTC
CCCGCCGGCA CCCGCTTCGC GCTCACCCTG CACTTCGAGG ACGCCGAGGG CAACGCCGGC
GAAACCGCGC AGGGCTCCTT CAGGACCGCC CCCGGAACCG GACTCATCAG CAGCGGCCGT
GCAGCCCGCA AGCAGAGCTT CGTCTGGACC GGCGACACCG CCGGCCAGGG TTGGGGCATC
AACGAGGAGA TCGGCGGCAT GCGCGGCTAC GCGGCCATGC ACGCCACCAG GCCGGACTTC
TTCATCCACT CCGGCGACAC GATCTACGCC GACGGCCCCA TCGCCGCGCA GGTCACCGAG
CCGGACGGGC AGATCTGGCG CAACCTCGTC ACCGAAGAAG TGTCCAAGGT GGCCGAGACC
CTCAACGAGT TCCGCGGACG GCACCGTTAC AACCTCATGG ACCGGAACGT CCGCGCCCTG
TACGCCGAGG TGCCGGTGAT CGCCCAGTGG GACGACCACG AAACGCACAA CAACTGGTAC
CCCGGCGAGG TCATCACCGA CCCCCGCTAC ACCGAACGCC GCGTGGACGT GCTCGCCGCC
CGCGGCCGCC AGGCCTGGCA GGAATACCAG CCCGTCTCCG GCCTCAGCAC CCGGATCGGC
GACGGCAGCA CGGGCTTTGA ACCCGCCCGG ATCTACCGCA AGATCTCCCG CGGCCCCCAG
CTGGACGTGT TCTGCCTGGA CATGCGCACC TTCAAGGACC CCAACACCGA CGGCAAGGAA
ACGCACCTCA CCCACATCCT GGGCCAGGAA CAGGCGGAAT GGCTCATCCG CGAAGTCAGC
AAGTCCAAGG CCACCTGGAA GGTCATTTCC GCCGACCTTC CGCTTGGCCT GATCGTCCCG
GACGGCCCGG TCAACCAGGA GTCCCTGGCC AACCGCGACG CCGGCGCCCC CTTGGGCAAG
GAACTCGAGA TCGCGGGGGT GCTCTCGGCG TTCAAGCGCA ACCGGGTTAA AAACGTGGTG
TGGCTCACGG CCGACGTCCA CTACTGCGCC GCCCACCACT ACTCGCCCGA GCGCGCCGCG
TTCACCGATT TCGACTCCTT CTGGGAATTC GTGGCCGGCC CCATCAACGC CGGCAGCTTC
GGCCCCGGCG AACTGGACGG CACCTTCGGT CCTGAGCGGG TGTTCTACAA GACCGGCGCC
TACGCCAACC AGTCCCCGCG CACCGGAGAA AACCAGTTCT TCGGCCACGT GGACCTGAAC
GAGGACGACG TCTTCACCGT CAGCCTCCGC AACGCCAACG GCACCGTCCT CTGGAACAAG
GAGCTCCAGC CGGCGCGGTA G
 
Protein sequence
MGSMTAFTRR NVLKGALAAA GAAAVVPAAL STGSTPAHAG VALVRNRLTL PSGIATGDVT 
ADSGVLWSRA SGPGRLVAGL LAVDDDGAPL RGGRAFQRVL RGSAASEATD FTSRINAEHL
PAGTRFALTL HFEDAEGNAG ETAQGSFRTA PGTGLISSGR AARKQSFVWT GDTAGQGWGI
NEEIGGMRGY AAMHATRPDF FIHSGDTIYA DGPIAAQVTE PDGQIWRNLV TEEVSKVAET
LNEFRGRHRY NLMDRNVRAL YAEVPVIAQW DDHETHNNWY PGEVITDPRY TERRVDVLAA
RGRQAWQEYQ PVSGLSTRIG DGSTGFEPAR IYRKISRGPQ LDVFCLDMRT FKDPNTDGKE
THLTHILGQE QAEWLIREVS KSKATWKVIS ADLPLGLIVP DGPVNQESLA NRDAGAPLGK
ELEIAGVLSA FKRNRVKNVV WLTADVHYCA AHHYSPERAA FTDFDSFWEF VAGPINAGSF
GPGELDGTFG PERVFYKTGA YANQSPRTGE NQFFGHVDLN EDDVFTVSLR NANGTVLWNK
ELQPAR