Gene Swit_2534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_2534 
Symbol 
ID5199410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2822063 
End bp2823343 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID640582089 
ProductHNH endonuclease 
Protein accessionYP_001263031 
Protein GI148555449 
COG category[V] Defense mechanisms 
COG ID[COG3183] Predicted restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAAAC TTGCCTCTCC CTTTGGGGCA ACCGGATGTT TGAGCCCGAC TCATGTCGTG 
CTACTCTGCG ATATCGGCGC AAGGGGGAAA GGCATGTGGC ATTGGGACCA AGGTCACGTT
GAATATTTCG AGTTCGACAC ACTTCGTGCC ATTTCTGCTT TTGTTCAAAA TCACGACTTC
ATGAGCACTT CGCGGCAGGT CCTGGCAGCA GCTACGGGCC TTCCATTCCC CGCTCCTCCT
ACTCATAGCC CTCAGCGCCA ATACGCCCGC GTTCTGAAAC TTAGTCTTTT GATCAGCGAG
AACGGCGGCG TTGCACAACC TACCCCAGTT TCACTTCTGC TCTCCCAACC AGGCGTCATC
ACATCTGATG AATATTTCCA TTTCCTGGCC CAAGCCTCAA CAGAGCCAAG TCCTGCGCTA
AAGGACTGGT ATCCGGAAGC ACCGTTCCGA TATCCGCTGC TGGTTGCTCT GAAGTATCTT
TTGGCCAAGG CGGCAGTGGG AAGCACACCG TCCGCCACTC TGGACGAGAT TATCGGAGCC
TATCGTGCGA CTGGTTTCGT TGGGTCGGAA GATGACGCGG CCTTTATTGG CTCGCTCGGT
AATGACTCTG CATATACAGT GAAGGGGCTG TCGGCACCCG ACAATCTTCG CCGGCAAGCT
CGAGAGAGCC TTAAGGTTCT CTGTCAGATT TCGTACCTGC ACGTCGTCCG CGATCGGGTG
TTCATCAATC TTGCTCGAAA AGATGCTGAA ATAGCCTTCG GAGAGCTGAA TCCCATCGGC
GGGCCGCGCG CCGCGAATCG CGATGCTGAA ATTAGGCGAT TGGCGAATCT CTTCACGGGA
GGCTCAACGC TCGACTTCTT CGACTATCCG GAGACCGTGA TAGCAGAGGT CGTCGAAAGT
GGCTTCGAGG AGGGGAATAA AGTTCAAAAA ACGCATGTCA CGATCGAGCG AAACAGTGGG
CTGAGGAAGG CATTCTTCGC TGCAAATCCG ACTACCGTTT GCGATGTCTG CAATCTCGAC
ACTGCACGGA GCTATCCGTG GACCGAGAGG GTCATGGACC TCCACCACCT ACTCCCCCTC
AGCTCCGGCA CTCGGGTTAT CGGCAGAGGG ACCACTTTCG ACGACCTCGT GCCGCTGTGC
CCCAGTTGCC ACAGAGCCGT TCATCGCTAC TATGGCGAAT GGTTCCGAAC CACGAAACGC
CTAGACTTTC ACAGCCGGGA TGAGGCGGTC GGGGTCTACA CCAATATGAA ATCGAAGTTC
CCAGGGCTTA TCCATGCATA G
 
Protein sequence
MSKLASPFGA TGCLSPTHVV LLCDIGARGK GMWHWDQGHV EYFEFDTLRA ISAFVQNHDF 
MSTSRQVLAA ATGLPFPAPP THSPQRQYAR VLKLSLLISE NGGVAQPTPV SLLLSQPGVI
TSDEYFHFLA QASTEPSPAL KDWYPEAPFR YPLLVALKYL LAKAAVGSTP SATLDEIIGA
YRATGFVGSE DDAAFIGSLG NDSAYTVKGL SAPDNLRRQA RESLKVLCQI SYLHVVRDRV
FINLARKDAE IAFGELNPIG GPRAANRDAE IRRLANLFTG GSTLDFFDYP ETVIAEVVES
GFEEGNKVQK THVTIERNSG LRKAFFAANP TTVCDVCNLD TARSYPWTER VMDLHHLLPL
SSGTRVIGRG TTFDDLVPLC PSCHRAVHRY YGEWFRTTKR LDFHSRDEAV GVYTNMKSKF
PGLIHA