Gene Swit_4410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_4410 
Symbol 
ID5200357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp4859823 
End bp4860788 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID640583962 
Productproline iminopeptidase 
Protein accessionYP_001264886 
Protein GI148557304 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.34926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCAC TGTCGACCCA GCTATATCCG CCGATCGAGC CCTATGCGAG CGGGATGCTC 
GACGTCGGCG ACGGCCACGG CATCTATTAT GAGCGGGTGG GGACGCCGGG CGCGAAGCCG
GCGGTGTTCC TGCACGGCGG GCCGGGGGCG GGCTGCTCGC CCGATCACCG GCGACTGTTC
GATCCGGCGC GCTATGACCT GCTGCTGTTC GACCAGCGCG GCTGCGGGCG ATCGGCGCCG
CATGCCGAGC TGACCGCCAA CACGACCTGG CACCTCGTCG CCGATATCGA GCGGCTGCGG
GCGATGGCGG GCGTCGAGGC GTGGCTGGTG TTCGGGGGAA GCTGGGGGTC GACGCTGGCG
CTCGCTTATG CGGAGACGCA CCCCGAGCGG GTCAGCGAAC TGGTGCTGCG CGGCGTCTAC
ACGGCGACGC GGGCCGAGAT CCAATGGTAT TACCAATGGG GCGTGTCGCA GATGTTCCCC
GATAAGTGGG AGCGCTTCGT CGCGCCGATC CCCGAGGCCG AGCGCGGCGA CATGGTCGCG
GCCTATAATC GCCGGCTGAC CGGCACCGAC CCCGCCGCGC AGATCGAGGC GGCGAAGGCC
TGGAGCCTGT GGGAGGGCGA GACGATCACG CTGCTGCCGA GCGCGGCGCT GACCGACCAG
CATGGCGACG ACCATTTCGC GATCGCCTTC GCGCGGATCG AGAATCATTA TTTCTTCCAC
GACTGCTGGC TGGAGCCGGA CCAGTTGCTG CGCGACGCCG GGCGGCTGCG CGGCATCCCC
GGCGTGATCG TCCACGGCCG CTACGACATG CCCTGTCCGC TCCATCATGC CTGGGCCCTG
CACAAGGCCT GGCCCGAGGC CGATTTCCAC CTGATCGAGG GGGCCGGCCA TGCCTATTCG
GAGCCGGGCA TATTGGAACA ATTGATAAAG GCGACCGATC GCTTCGCGGG GAGAGGGCAG
GAATGA
 
Protein sequence
MSALSTQLYP PIEPYASGML DVGDGHGIYY ERVGTPGAKP AVFLHGGPGA GCSPDHRRLF 
DPARYDLLLF DQRGCGRSAP HAELTANTTW HLVADIERLR AMAGVEAWLV FGGSWGSTLA
LAYAETHPER VSELVLRGVY TATRAEIQWY YQWGVSQMFP DKWERFVAPI PEAERGDMVA
AYNRRLTGTD PAAQIEAAKA WSLWEGETIT LLPSAALTDQ HGDDHFAIAF ARIENHYFFH
DCWLEPDQLL RDAGRLRGIP GVIVHGRYDM PCPLHHAWAL HKAWPEADFH LIEGAGHAYS
EPGILEQLIK ATDRFAGRGQ E