Gene Saro_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2971 
Symbol 
ID3917406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3191032 
End bp3192723 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content69% 
IMG OID640445749 
Productprotein serine/threonine phosphatases 
Protein accessionYP_498240 
Protein GI87200983 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.59777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGGAA GTGGCGAACT CGTGGTGGCG GCAGGCTTCT CGAGCCTGAC CGGACCGCGA 
GCCGACAATC AGGATTTCGG CGGCGTGCAT CTGGGCACGG CGCTGGAACG GGCCCGCCAC
GGTGCGGTGG CGGCCATTGC CGATGGCGTG TCGGGCGGGC GCGAGGGGCG GGCAGCAGCG
GAACTGGCGG TTCGCGCGCT GATCGAAGGC TTCTATGCGA TGCCCGGCAC GCTGGGGCCA
GCGCGCGCGA TGCAGGCGCC GCTCGCCGCC TACAACCGCT GGCTTCACGC CCAGGGCCGC
GGCGAAACGA TGGCGAACAG TGCCACGACA TTCACCGCAA TTGCGCTGCG CGGTCGCCGC
GCCCACCTGG TCCACGTCGG CGACAGCCGC GCGTGGCGCT ACTCCGGCGG GCGCCTGACC
TGCCTCACCG CCGACCATAC GCGCCCCGAA CCCGATCTGA ATCACGTGCT CATCCGCGCG
CTCGGGATCG AACCCGAACT GCGGCTCGAC CACTCCGATC TCGAACTTGC CGAGCACGAC
CGCCTGGTCC TCACGACCGA CGGCATCCAC GCCGTCCTTT CGGCAAAGCG GATCGCCGCC
ATCCTTGCCG AAAGCGCCAG CGCAGAAGCG ACCGCCGAAG CGCTTGCCGA AGCCGCCATC
GCCGCGGGCG GCCGCGACAA TGCCACAGCC GTCGTCCTCG ACATCGTCCG ACTGCCCGCC
CCCGATCACG ACGGCATCCT CGCAGGCCTC TCCGCCCTGC CCTTCGCCGA CCCGCCCCGC
CCCGGCGAAA GCATTGACGG CTTTCGCGCC GAACGCATCC TCTCCGAAGG GCGCCATGCC
GTCCTTCTGA TCGCCACCGA TTGCGAGGAC GGTAGCCGCG TGGTGCTGAA GTTCCCGCGC
CGAGAAATCC TGTCCGACCG GGCGCTCCGC CTGGCGTTCG CACGCGAAAT GCTGCTCGCC
CAGCGTGTAT CGAGCCCGTT CATCCTGGCA GCGCACCCGG TCCGGCCAGA TCGCCAGAGC
GCGCTCTACG GCGTCCAGCC GTTTCTCGAA GGCGAGACCA TGGCCCAGCG GCTGGAACGC
GGCCTGCCCT CGCTGCGCAC CGCGCTCGAC ACAGCGATCA AGCTGACGCG CGGGGTTGCT
GCCCTGCATC GCCTTGAAGT CGTCCACCGC GACATCAAGC CCGACAATGC GATCCTGACT
GCCGACGGCG GCCTGCGCCT CATCGATCTC GGCGTCGCGC GACTGCCGAA GGTCGAGGAT
TTCCACGCCG ACGAAATCCC CGGAACGCCG GGTTTCATGG CCCCCGAACA GTTCGAAGGC
AACGCCGGCG ATGCCCTGAC CGACCAGTTT GCGCTCGGCG TCACCCTCTA CCGCTGGTTC
ACCGGCAAGT GGCCGTTTGG CGAACAGGAG GCCTTCCAGC GGCCGCGATT CAACCGCCCT
GCACCGCCCT CTCGCCACCG TCCGGAAATT CCCAGCTGGC TCGATGACGC GATCCTCACC
GCGATCCAGC CCGACCGGGA CAAGCGTTTC GGTGACGTGA TCGAACTCCT GCGTGCGCTC
GAAGGCGGCG GAACGCTTGC CAGCGGCCCG AAGCGGGACA TACCCCTGAT CGAGCGGGAT
CCGGTCCGTT TCTGGCAGAT CGTCAGCGCA TTGCTCGGCG CGGCGCTGAT CGCCTCGTTG
CTTCTGCGCT GA
 
Protein sequence
MRGSGELVVA AGFSSLTGPR ADNQDFGGVH LGTALERARH GAVAAIADGV SGGREGRAAA 
ELAVRALIEG FYAMPGTLGP ARAMQAPLAA YNRWLHAQGR GETMANSATT FTAIALRGRR
AHLVHVGDSR AWRYSGGRLT CLTADHTRPE PDLNHVLIRA LGIEPELRLD HSDLELAEHD
RLVLTTDGIH AVLSAKRIAA ILAESASAEA TAEALAEAAI AAGGRDNATA VVLDIVRLPA
PDHDGILAGL SALPFADPPR PGESIDGFRA ERILSEGRHA VLLIATDCED GSRVVLKFPR
REILSDRALR LAFAREMLLA QRVSSPFILA AHPVRPDRQS ALYGVQPFLE GETMAQRLER
GLPSLRTALD TAIKLTRGVA ALHRLEVVHR DIKPDNAILT ADGGLRLIDL GVARLPKVED
FHADEIPGTP GFMAPEQFEG NAGDALTDQF ALGVTLYRWF TGKWPFGEQE AFQRPRFNRP
APPSRHRPEI PSWLDDAILT AIQPDRDKRF GDVIELLRAL EGGGTLASGP KRDIPLIERD
PVRFWQIVSA LLGAALIASL LLR