Gene Smal_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmal_3999 
Symbol 
ID6474893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStenotrophomonas maltophilia R551-3 
KingdomBacteria 
Replicon accessionNC_011071 
Strand
Start bp4518888 
End bp4520159 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID642733212 
Productprotein of unknown function DUF323 
Protein accessionYP_002030381 
Protein GI194367771 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0859186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG TACCCGCCGC CGTCGCTGCC CCGCAGCACG ATCTCGCCCG CCAGTTCGCC 
CATGTACGGA CGCGCAGCCT GCAGCTGGCC GCGCCGCTCA GTGCCGAGGA CGCCATGCTG
CAGAGCATGG CTGACGCCAG CCCGAGCAAA TGGCACCTGG CCCACACCAC CTGGTTCTTC
GAACGCTTCG TGCTGGCGGG TTTCGGCACC GCGCCAGCGC ATGACCCGGC CTGGGACTAC
CTGTTCAACA GCTACTACAA GAGCATCGGC CCGGCGCATG CGCGGCCACA GCGCGGGTTG
CTGTCGCGGC CCTCGCTGCA GCAGGTGCGC GACTACCGCA AGCAGGTCGA TGCGCAGGTG
CAATCGCGGT TGGCGGCGGG CGATCTCGAT GAACAGGCAC TGCAGCATCT GCAGCTGGGC
CTGCAGCACG AGCAGCAGCA CCAGGAACTG CTGCTCACCG ACATCAAGCA TGCGTTCTGG
TGCAATCCGC TGCAGCCGCC ATATCGCGAA GACCTGCAAC CCGTCGCAGG CAACGCCAGC
GCTCAGGGCT GGATCGAATC ACCCGAGCGC ATCGTCACCG TCGGCGCCGC GGCATGGCCG
CAGCAGGCCG CGTTCGCCTA TGACAACGAA TCGCCAGCGC ACCGCGTGGT GCTCCCCGCG
CACGCACTGG CCGAGCGTCC CGTCAGCAAT GCCGAGTACC ACGCCTTCAT CGAGGCCGGT
GGTTACCGGG AGCCGCGCTG GTGGCTCAGC GAAGGCTGGG CGCTGCGCGA AGCCGAGGGC
TGGCAGCACC CGCTGTACTG GGATGACGAC CTGCAGCGCG AGTACACCCT CGGCGGTTGG
CGTGCGCTGG ATCCGCACGC GCCGGTCTGC CATCTCAGCT ACTACGAGGC TGACGCCTGC
GCCCGCTGGG CCGGTGCACG CCTGCCCAGC GAGTTCGAAT GGGAAGCGGC TGCGACGTCG
CAACCGGTCA GCGGCCACTT CGCCGAGGAT GACCACCTGC ACCCGTTGGC GGGGCAGGGC
AGTGGACTGC GCCAGCTGTT CGGCGATGTC TGGGAATGGA CCCAATCGGC CTACGGCGCT
TATCCCGGTT TCCGCCCGTT CGCCGGCAAC CTGGGTGAGT ACAACGGCAA GTTCATGTGC
GGGCAGTGGG TGCTGCGCGG CGGCAGCTGC GCCACGCCGC GCGGACATGT GCGCGCCAGC
TACCGAAACT TCTTCATGCC GCCGGCGCGC TGGCAGTTCT CCGGGCTGCG CCTGGCCAGG
GACCTCACAT GA
 
Protein sequence
MDSVPAAVAA PQHDLARQFA HVRTRSLQLA APLSAEDAML QSMADASPSK WHLAHTTWFF 
ERFVLAGFGT APAHDPAWDY LFNSYYKSIG PAHARPQRGL LSRPSLQQVR DYRKQVDAQV
QSRLAAGDLD EQALQHLQLG LQHEQQHQEL LLTDIKHAFW CNPLQPPYRE DLQPVAGNAS
AQGWIESPER IVTVGAAAWP QQAAFAYDNE SPAHRVVLPA HALAERPVSN AEYHAFIEAG
GYREPRWWLS EGWALREAEG WQHPLYWDDD LQREYTLGGW RALDPHAPVC HLSYYEADAC
ARWAGARLPS EFEWEAAATS QPVSGHFAED DHLHPLAGQG SGLRQLFGDV WEWTQSAYGA
YPGFRPFAGN LGEYNGKFMC GQWVLRGGSC ATPRGHVRAS YRNFFMPPAR WQFSGLRLAR
DLT