Gene Swit_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3201 
Symbol 
ID5198205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3518084 
End bp3519943 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content67% 
IMG OID640582747 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001263686 
Protein GI148556104 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.814938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATA TTCCCGCGCG CACCGAGATG AAGGTGACGA CCGGTCCGAT CCGGGGATCG 
CGCAAGATCC ATGTCGAGGG GCCTCAGGGC GTCCGCGTCG CGATGCGCGA GATCGCGCTC
GAGCCGTCGT CGGGCGAGCC GCCGGTCCGC GTCTATGACT GCTCCGGCCC CTATACCGAT
CCCAACGCCC ATATCGACAT CATGGCCGGC CTGCCCGCGC TGCGCCGCGA CTGGATTCTC
GGGCGCGGCG ACGTCGAGGA CTATGAGGGC CGCGCGATCA AGCCCGAGGA CAACGGCCTG
AAGGGGCCGG ACCGGTCGGG CGGCGTCACC CCCTTCCCGA ACGTGGTGCG GCGCCCGCTG
CGCGCGAAGG CAGGGCAGAA TGTCAGCCAG ATGCACTATG CCCGGCGCGG CATTATCACG
CCCGAGATGG AATATGTCGC GATCCGCGAG AATGTCGGCC GCGCCGCGCT CAAGGAGAAG
CTGCTGCGCG ACGGCGAGGA TTTCGGCGCG GCGATCCCCG ACTTCGTCAC CCCCGAATTC
GTCCGCGACG AGGTGGCGCG CGGCCGGGCG ATCATCCCCA GCAACATCAA CCATCCCGAA
TCGGAGCCGA TGGCGATCGG CCGCAACTTC CTGGTGAAGA TCAACGCCAA TATCGGCAAT
TCGGCGGTCG CTTCCGACGT CGCGGCCGAG GTCGACAAGA TGGTCTGGTC GATCCGCTGG
GGCGCCGACA CGGTGATGGA CCTGTCGACC GGCCGCAACA TCCACGACAC GCGCGAATGG
ATCCTGCGCA ACTCGCCGGT CCCGATCGGC ACGGTGCCGA TCTACCAGGC GCTGGAGAAG
GTCGGCGGCA TCGCCGAGGA CCTGACCTGG GAAATCTTCC GCGACACGCT GATCGAGCAG
GCCGAGCAGG GCGTCGACTA TTTCACCATC CATGCCGGCG TCCGCCTGCC CTTCATCCCA
ATGACGGCCA AGCGCGTCAC CGGCATCGTC AGCCGGGGCG GATCGATCAT GGCGAAATGG
TGCCTGGCGC ACCATCGCGA GAGCTTCCTC TACGAGCGGT TCGACGAGAT CTGCGAGATC
ATGAAGGCCT ATGACGTCGC CTTCTCGCTG GGCGACGGCC TGCGCCCCGG ATCGATCGCC
GACGCCAATG ACGAGGCGCA GTTCAGCGAA TTGAAGACGC TGGGCGAGCT GACCCAGATC
GCCTGGCAGC ACGACGTGCA GGTGATGATC GAGGGCCCCG GCCATGTGCC GATGCACAAG
ATCAAGGCGA ACATGGACAA GCAGCTCGAA GCCTGCGGCG AGGCGCCCTT CTACACGCTC
GGGCCGCTGA CCACCGATAT CGCGCCGGGC TATGATCACA TCACCTCGGC GATCGGCGCG
GCGATGATCG GCTGGTTCGG CACGGCGATG CTCTGCTACG TCACGCCGAA GGAGCATCTG
GGCCTGCCCG ACCGCGACGA CGTGAAGGTC GGCGTGGTCA CCTACAAGCT CGCCGCCCAC
GCCGCCGACC TGGCGAAGGG CCACCCGGCC GCCAAGCTGC GCGACGACGC GCTCAGCCGC
GCCCGCTTCG AGTTCCGCTG GCGCGACCAG TTCAACCTGT CGCTCGATCC CGACACGGCC
GAGCAGTATC ACGACCAGAC GCTGCCGGCC GAAGGCGCCA AGACCGCGCA TTTCTGCTCG
ATGTGCGGGC CGAAATTCTG CTCGATGAAG ATCACCCAGG AGGTCCGCGA CTTCGCCGCC
ACCAGGAACG CCCCTGCCGA CCAGTTCATC GCGGCCGGCG ACGCCGAGGC GGGCATGCGC
GCGATGAGCG ACGTGTTCCG CGAGAAGGGC GGGGAGATCT ATCTGCCGGC GGCGGAGTAG
 
Protein sequence
MADIPARTEM KVTTGPIRGS RKIHVEGPQG VRVAMREIAL EPSSGEPPVR VYDCSGPYTD 
PNAHIDIMAG LPALRRDWIL GRGDVEDYEG RAIKPEDNGL KGPDRSGGVT PFPNVVRRPL
RAKAGQNVSQ MHYARRGIIT PEMEYVAIRE NVGRAALKEK LLRDGEDFGA AIPDFVTPEF
VRDEVARGRA IIPSNINHPE SEPMAIGRNF LVKINANIGN SAVASDVAAE VDKMVWSIRW
GADTVMDLST GRNIHDTREW ILRNSPVPIG TVPIYQALEK VGGIAEDLTW EIFRDTLIEQ
AEQGVDYFTI HAGVRLPFIP MTAKRVTGIV SRGGSIMAKW CLAHHRESFL YERFDEICEI
MKAYDVAFSL GDGLRPGSIA DANDEAQFSE LKTLGELTQI AWQHDVQVMI EGPGHVPMHK
IKANMDKQLE ACGEAPFYTL GPLTTDIAPG YDHITSAIGA AMIGWFGTAM LCYVTPKEHL
GLPDRDDVKV GVVTYKLAAH AADLAKGHPA AKLRDDALSR ARFEFRWRDQ FNLSLDPDTA
EQYHDQTLPA EGAKTAHFCS MCGPKFCSMK ITQEVRDFAA TRNAPADQFI AAGDAEAGMR
AMSDVFREKG GEIYLPAAE