Gene Swit_4690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_4690 
Symbol 
ID5196375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp5158375 
End bp5159409 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content74% 
IMG OID640584244 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_001265165 
Protein GI148557583 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.036705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.270203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTCG ATGCCGTCCC CGAAAATTTG CTGGCCTGGT ACGATGCCCA TCATCGCCGC 
CTGCCCTGGC GCGCCGCCCC CGGCGAGGCG CCGACCGATC CCTATCGCGT CTGGCTGTCG
GAGATCATGC TGCAGCAGAC CACCGTCGCG GCGGTGAAGC CCTATTTCGA TCGCTTCACG
ACGCGCTGGC CGACCGTCAC CGACCTCGCC CGCGCCGACG AGGGCGAGGT GATGGCCGCC
TGGGCGGGGC TCGGCTATTA TGCCCGCGCC CGCAACCTGA TCGCCTGCGC CCGCGCCGTC
GCCGACGATC ATGGCGGCCG CTTTCCCGAC AGCGAGGCGG GGCTGCGCGC GCTGCCCGGC
ATCGGCGACT ACAGCGCCGC CGCGATCGCC GCGATCGCCT TCGGCCGCCG CGCGGTCGTC
GTCGACGCCA ATGTCGAGCG GGTGGCGAGC CGCCTGTTCG CGTTCGACGA GGCCCTGCCC
AGGGCCCGCC CGGCGCTGCG CGCGCTGGTC GACCGGATCA CTCCCGACGC GCGCGCCGGC
GACTTCGCCC AGGCGATGAT GGACCTCGGC TCCTCGATCT GCACGGTCCG CGCGCCGCAA
TGCCTGCTCT GCCCGCTGAG CGCCGGCTGC GCCGCGCGGA TCGCCGGCAA TCCCGAGGAC
TATCCGGTGA AGGCCGCGAA GAAGGCCAAG CCGCAGCGGC TCGGCACCGC CTTCTGGATC
GAGGACGGCG CGCGCGTCTG GCTGGTGCGG CGGCCCGACA AGGGCATGCT CGGCGGCATG
CGCGCGCTGC CCTCGGGTCC CTGGACCGAC GAGGACCCCG GCCTCGCCGA CGCGCCGGTC
GACGCGCCCT GGCGCGAGGC GGGGTCGGTC GACCATGTCT TCACCCATTT CGCGCTCCGG
CTGCGCGTCG TCACCGCCGT GCAACCGCTC CGCGCCAATG ACGGCGAATG GTGGCCGATC
GACGAGATCG AGCAGGCCGG CCTCCCCACC CTCTTCGCCC GCGCCGCCGC GCGCGCCATC
GCCAGCCGCG CATAG
 
Protein sequence
MSVDAVPENL LAWYDAHHRR LPWRAAPGEA PTDPYRVWLS EIMLQQTTVA AVKPYFDRFT 
TRWPTVTDLA RADEGEVMAA WAGLGYYARA RNLIACARAV ADDHGGRFPD SEAGLRALPG
IGDYSAAAIA AIAFGRRAVV VDANVERVAS RLFAFDEALP RARPALRALV DRITPDARAG
DFAQAMMDLG SSICTVRAPQ CLLCPLSAGC AARIAGNPED YPVKAAKKAK PQRLGTAFWI
EDGARVWLVR RPDKGMLGGM RALPSGPWTD EDPGLADAPV DAPWREAGSV DHVFTHFALR
LRVVTAVQPL RANDGEWWPI DEIEQAGLPT LFARAAARAI ASRA