Gene Sala_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2787 
Symbol 
ID4080368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2948318 
End bp2951125 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content66% 
IMG OID638011171 
ProductPII uridylyl-transferase 
Protein accessionYP_617825 
Protein GI103488264 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.565994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.190561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAC ACTGGCGCTG GCGCGGCGGG CACCCTAAAC AGGGGGCGAT GGCCCATCCC 
TTTCACTCGC TTTCGAACTG GCGCGCGATC ATCGACCGCC GTGCGCTTGC GGGACAGCTC
GATACGATCG CCGCCGAAAC ACAGGATGCG GCCGCGCGCC GCCGCGCGAT GGTCGTGCTG
CTCAGAGCGG CGCTCGAGGA CGGCCGCGCG GAGGTCGAGC GCCGCCTGCT CGCGCATCCC
TCGTCGGGGC GCGTCGCGGC GCAGACGACC GCTTTCCTGA TCGACCAGCT CGTCCGGCTC
AGCCACGACT TCACCGTTGA GCATCTTTAC CCGGCGAACA ACCGCTCGGC GGGCGAGCGC
ATCACCTTGA TCGCGGTCGG CGGCTATGGC CGCGGCGAAA TGGCGCCCCA CAGCGACATC
GACATCGGCT TCCTCACCCC GTTCAAGCAG ACGAGCTGGA CCGAACAGGC GATCGAGGCG
CAGCTCTATA CGCTCTGGGA TCTGGGGCTG AAGGTCGGCC ATTCCAGCCG TTCGCTCGAC
GAAATGGTAC GCGCCGCCAA GGATGACCTC ACCATCCGCA CCGCGCTGCT CGAAGGGCGC
TTCATCTGGG GCGACCGCGA GCTTTACGAT CAGGCCGCGG CGCGCTTCGA CGCCGAGGTG
GTCGCGGGCA ATGCGCGCGC CTTCGTCGCC GAAAAGCTTG CGGAACGCGA CGAGCGCCAC
AAACGCATGG GGGATTCGCG GTACGTCGTC GAACCCAATG TGAAGGAGGG CAAGGGCGGG
CTGCGCGACC TCCACACCTT GTTCTGGATC GGCAAGTTCA TCCACCGCGT ACGCACGGTG
CCCGAGCTGG TCGATGCCGG GCTGCTGACG GCGCGCGAGC TCAAGCAGTT CGCGCGCGCC
GAAAATTTCC TGCTCGCGGT GCGCTGCCAC CTCCATGTCC TCGCGGGCCG CGCCGAGGAC
CGGCTGACCT TCGATTTTCA GCGCGAGATC GCGAGCCGCA TGAAATTCGC CGACCGCCCC
GGCAAGAGCG CGGTCGAACG CTTCATGCAA CTCTATTTCC TCCATGCCAA AAGCGTCGGC
GACCTGACCG GCACCTTCCT TGCGCATCTC GACGACCAGA TGGCGGCGCG TGGCCGCCGC
TTCCTCCCGT CGATCCGGCG GCGGCCGGGC AAGCTCAATG GCTTTGTCCT CGACCGTGGC
CGCCTCGCGC TGCCGTCGGA CGATTTTTTC GCCGCCGACC CGGTGCGGCT GATCGAGATT
TTCGCGCTTG CCGACAAGCA CGGTCTTGAA ATCCACCCGC AGGCGATGCG TCAGGCGCGC
CACGACGCGA AGCTGATCGA GACGCAGGGC GTACGCCGCG ACGCGCGCGC CAACGCGCTG
TTCCTCGACG TGCTCACCAG TCCGCGCGAC CCCGAAACGG TGCTGCGCTG GATGAACGAG
GCGGGGGTGT TCGGCCGCTT CGTGCCCGAT TTCGGCCGCG TCGTCGCGCA GATGCAGTTC
GACATGTATC ACCATTATAC CGTCGACGAA CATACGATCC GCGCGATCGG GCTGCTCGCC
GACATCGAAC AGGGGCGGCT GAAGGAGGAC CACCCGCTTT CGACCGCGAT CATGGGGCAG
ATCCATTCGC GGCGCGTCGT CTATGTCGCG GTGCTGCTGC ACGACATCGC CAAGGGGCGC
GGCGGCGACC ACAGCGTGCT CGGCGCCGAA CTGGCTTTGC GCGTATGCCC GCGGCTGGGT
TTGAGCGAGG CGGAGACCGA GACCGTGTCA TGGCTCGTGC GCTATCACCT GCTCATGTCG
GCGACCGCGT TCAAGCGTGA CCTCGCCGAT TTCAAGACGA TCCTCGACTT CGCGCAAATC
GTGCAAAGCC CCGAACGGCT CCGCCTGCTG CTCGTCCTCA CCGTCGTCGA TATCCGCGCG
GTCGGCCCCG GCGTGTGGAA CAGCTGGAAA CGGCAATTGC TCAGCGAACT CTACGACGCG
GCCGAGGAAG TGCTGCGCCT CGGCCACAAG CAGAAGGGCC GCGAACAGCG CATCGCGAGC
AAGAAGGAAG CGGTCGCGGC GCAGTTCGGC TTCGACCGCA AGACGTTCGA CAAGGTGGCA
AAGCGCCTGC CCGAAAGTTA CTGGATCGCC GAGCCGGTCG AGGTCATCGC CGCCAACCTC
GTCCATATCC GCCAGGCGGG CGACGCACCG CTGCACATCG CCGCGGTCCC CGACGATGAC
CGCGGCGCGA CGCTGGTGAT GGTACTTGCG GCCGACCATC CGGGGCTCTT TTATCGCATC
GCGGGGGGCA TTCATCTGGC CGGCGGCAAT ATCATCGACG CGCGCATCCA CACGACGCGC
GACGGCCTCG CGCTCGACAA TTTTCTGGTG CAGGATCCGC TCGGTCGTCC CTTCGCCGAA
ACGGGGCAGA TCGCCCGGTT GACCCGCGCG ATCGAGGACG CGCTCGCCAA CCGCCAGAAA
CTGCTTCCCA AACTCGAAGC CCGCGCCCTC CCGCGCACTC GCGCCGAGGC GTTCCGCGTC
GCCCCCAATG TCTTCGTCGA CAACAAGGCG TCGAACCGGT TCACCGTGAT CGAGGTCAAT
GCGCAGGACC GCCCCGCGCT GCTCAACCAG CTCGCCTATG CGCTCTTCCA GTCGAAGGTC
ACGGTCCACA GCGCCCATGT TGCCACCTAT GGCGAACGCG CGGTCGACAC CTTTTATGTC
ACCGACCTGA TCGGCGACAA GATCGACAGC CCGGCGCGGG TCAAGACATT GGAGAAACGC
CTGCTGGAAG CGGCAACCAG TCAGAGCGAG GAAGTGGTGG CGGCTTAG
 
Protein sequence
MHGHWRWRGG HPKQGAMAHP FHSLSNWRAI IDRRALAGQL DTIAAETQDA AARRRAMVVL 
LRAALEDGRA EVERRLLAHP SSGRVAAQTT AFLIDQLVRL SHDFTVEHLY PANNRSAGER
ITLIAVGGYG RGEMAPHSDI DIGFLTPFKQ TSWTEQAIEA QLYTLWDLGL KVGHSSRSLD
EMVRAAKDDL TIRTALLEGR FIWGDRELYD QAAARFDAEV VAGNARAFVA EKLAERDERH
KRMGDSRYVV EPNVKEGKGG LRDLHTLFWI GKFIHRVRTV PELVDAGLLT ARELKQFARA
ENFLLAVRCH LHVLAGRAED RLTFDFQREI ASRMKFADRP GKSAVERFMQ LYFLHAKSVG
DLTGTFLAHL DDQMAARGRR FLPSIRRRPG KLNGFVLDRG RLALPSDDFF AADPVRLIEI
FALADKHGLE IHPQAMRQAR HDAKLIETQG VRRDARANAL FLDVLTSPRD PETVLRWMNE
AGVFGRFVPD FGRVVAQMQF DMYHHYTVDE HTIRAIGLLA DIEQGRLKED HPLSTAIMGQ
IHSRRVVYVA VLLHDIAKGR GGDHSVLGAE LALRVCPRLG LSEAETETVS WLVRYHLLMS
ATAFKRDLAD FKTILDFAQI VQSPERLRLL LVLTVVDIRA VGPGVWNSWK RQLLSELYDA
AEEVLRLGHK QKGREQRIAS KKEAVAAQFG FDRKTFDKVA KRLPESYWIA EPVEVIAANL
VHIRQAGDAP LHIAAVPDDD RGATLVMVLA ADHPGLFYRI AGGIHLAGGN IIDARIHTTR
DGLALDNFLV QDPLGRPFAE TGQIARLTRA IEDALANRQK LLPKLEARAL PRTRAEAFRV
APNVFVDNKA SNRFTVIEVN AQDRPALLNQ LAYALFQSKV TVHSAHVATY GERAVDTFYV
TDLIGDKIDS PARVKTLEKR LLEAATSQSE EVVAA