Gene Swit_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3040 
Symbol 
ID5198277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3333954 
End bp3335990 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content63% 
IMG OID640582589 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001263528 
Protein GI148555946 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.157189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAAGC TGCTCGTGCG ATTGTTTCTC CTCACGTGGG TCATGGCGGG ATCGCCGGGC 
CTGTACGCGC GCGGCTTCGA AGCCTCCGAT TCCATGAACA TTGTCGGCAT GAAGCATCCG
CAGATCGCTC CCGATGGAGC GCGCGCGTCG ATCATCGTTT CAAAGGCCGA TATCGAGGCG
AACCGGTTCA CCGCCGAACT ACGCGTCGTC GACACGCGCA CCGGACAAGA AATATTGGCC
CTCGATCCGG CATGGGAGGT GAGTGAAAGC CGCTGGGCGC CTGACGGCCA GCAGCTCGCC
ATCATTGCGA AGCGTGCTGG TGTGAAGGAG GCAGTGGCAC AGATCTACGT GCTGGACATC
AAATCCCCCA CGCCGCGGCA GATCACGCAG GCCTCGGATG GCATCGTTCA GCTTGCCTGG
GGGCCCGACG GCAAGAGCGT CGCCTATGGC CGCGAAGAAG CGCTCGGACT GCCCAAAGTC
GACGGCAAGC TCGTGTCCTT CGAAGTGAAG GAGCGGGGCT ATCTCGCCAC CGAGCCGCGC
AAGCGAATCC AGCTCTGGAT CGCGCAGGCC GATGGTTCGG GTGAAAAGCA ACTAACGCGG
GGGAATTGGA CCCTCCAGGT GCCCTATAAG GGATCGGGTC CGGCAATCCT TCCCTTCGCG
TGGTTCCCTG ACGGCAAGAG TATCGTCGTG GCGACCCAGG CCGAGCCCGA TGTGAACTCG
CTGGAGCGGG CTCTCCGTAT CGTAGACGTC GCAACGGGCG AAGTTCATTC GTTGCTGGAG
CCTGACGCCC GCGCGATCAA TCCGGTCGTT TCGCCCGACG GGCGCGAGGT CGCCTATTGG
GAATATCCCA GATATGGCGA TGCCTTTTCC TTCTCCGTTC GCGTTGTGGA CGTGGCGAGC
CGGACGGTAC GGGCGTCTCC GCCGCAGCCG CTCGATCATA ATCTCAGCCT GGCGCGTTGG
TTTCCGAAGG GCGGGGACAT TCTCGTCGGC GGCGACGACA GCGATCGGAC GAGCCTCTGG
ATACAGCGCA GAGCCGCGCC GCCGATCAAA GTGGACACCG GCAACCTCGA TCTGGCGATG
CATTTTCGCG TTCGCGCGGA CATGTCCGCC ACCGGCGCGA TCGTCTTCGT GGCGAGTTCC
CCGCAAAGTC CTTTCGAACT CTATTTCATG GCTAATGCGA ACGCGCGTCC TCGGCGGCTG
ACCGACTATA ATGCCGCGGC CAGGGCGCTC GAGTTGGGGA GCGCAAATAT GCTCCGTTGG
CGGACTCATG ATGGCTTCGA AGCGAACGGG GTCGTGACCG TGCCCTACGG TTTTCAGGCG
AACCGTCGTT ACCCGCTGGT GATCGTTTCG CACGGTGGGC CGATGCAGGC TTCGACGCTG
AAATGGAACA GCTTCACTCA GGAACTCGCG GCCCGCGGCT GGATCGTGTT CGAGCCCAAT
TACCGGGGCA GCGACAATCT CGGGCGGAAT TATCAGGTAG CCATCGTCGG CGACATGGGG
GCGGGCCCCG CACGCGATGT GATGGCTGGG CTAGGCGAGT TGAAGAGGCA ATTTCCCATC
GATGCGGATC GGATCGCCGT TACGGGCGAG TCCTATGGCG GCTATATGTC GGGGTGGCTG
ATCAGCCATT ATCAGGGCTG GCGTGCGGCC GTTCTCGGCG CTCCCCTGCT TGATATTGCC
GATTTTGCCG ATCTCGCGGA TGTCGGGGAC CTCGCGGGAA GCCGGTTCTC AGGCGCGTCC
CCGTGGGAGC CGGGCGGGCA GGCTGTCAAT CAGAGCCAGT CGCCCATCTA TGCCTCGGCA
GCGGTCAAGA CGCCCACGCT CTTCCTCACC ACGACATTCG ATCATCGCGT GCCCGTGGTG
AGCTCCCACC GGATGTTTCA GGCGCTGCGG CAGAACAAGG TCGAGACGCG CTTCTTCATC
TACCCCGTCG AAGGGCATGG GACGAGCGAT CCCGTCCGGG AGCTCGATTG GAACAAGCGC
TGGATGGACT GGATCGCCGG GCATTTCGAT GGCGCACCGG GCGAGAAGGG AAAGTGA
 
Protein sequence
MTKLLVRLFL LTWVMAGSPG LYARGFEASD SMNIVGMKHP QIAPDGARAS IIVSKADIEA 
NRFTAELRVV DTRTGQEILA LDPAWEVSES RWAPDGQQLA IIAKRAGVKE AVAQIYVLDI
KSPTPRQITQ ASDGIVQLAW GPDGKSVAYG REEALGLPKV DGKLVSFEVK ERGYLATEPR
KRIQLWIAQA DGSGEKQLTR GNWTLQVPYK GSGPAILPFA WFPDGKSIVV ATQAEPDVNS
LERALRIVDV ATGEVHSLLE PDARAINPVV SPDGREVAYW EYPRYGDAFS FSVRVVDVAS
RTVRASPPQP LDHNLSLARW FPKGGDILVG GDDSDRTSLW IQRRAAPPIK VDTGNLDLAM
HFRVRADMSA TGAIVFVASS PQSPFELYFM ANANARPRRL TDYNAAARAL ELGSANMLRW
RTHDGFEANG VVTVPYGFQA NRRYPLVIVS HGGPMQASTL KWNSFTQELA ARGWIVFEPN
YRGSDNLGRN YQVAIVGDMG AGPARDVMAG LGELKRQFPI DADRIAVTGE SYGGYMSGWL
ISHYQGWRAA VLGAPLLDIA DFADLADVGD LAGSRFSGAS PWEPGGQAVN QSQSPIYASA
AVKTPTLFLT TTFDHRVPVV SSHRMFQALR QNKVETRFFI YPVEGHGTSD PVRELDWNKR
WMDWIAGHFD GAPGEKGK