Gene Csal_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0959 
Symbol 
ID4026736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1076922 
End bp1078823 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content65% 
IMG OID637966136 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_573015 
Protein GI92113087 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCC CTCAGTTTCT CTCCGACACC GCGCACGTCG ATGCCTCGGC CATTCGTCCG 
CTGCCCGGTT CATGCAAGCG ATACGTTCAG GGCTCGCGGC CGGACCTGCG CGTCCCCTTC
CGTGAAATCG CCCTGTCGCC GACCACGACC TCCGGCGCCG CCGAAGAGAA TCCACCGCTG
CTGGTCTACG ATACCTCGGG GCCGTACACC GATCCCGAGT GCGCCATCGA CCTGCGCAAG
GGCCTCCCGG CCCTCAGAAA GGACTGGATC GACGAGCGCG ACGACACTCG ATGGCTCGAC
GGCCCCACCA GTCGCTACGG GCAGCGCCGC GCCAACGACC CGAGGCTCGC ACCGCTGCGC
TTCGACCTGA CCCGCACGCC GCGGCGAGCC AAGGAAGGTC GCAACGTCAC CCAGCTGCAT
TACGCGCGCC AGGGCATCAT CACGCCGGAG ATGGAGTTCA TCGCCATCCG CGAGAATCAG
CGTCGTCAGG CGCTGGGTCC CGAGGAAGTC GAACGCATCC TCGGCCACCA GCACCCGGGC
CAGGGGTTCG GTGCACGCCT TCCGGAGGAG ATCACGCCGG CATTCGTGCG GGATGAAGTC
GCCCGCGGCC GGGCGATCAT CCCCTGCAAC ATCAACCACC CGGAGTCCGA GCCGATGATC
ATCGGCCGCA ACTTCCTGGT GAAGATCAAC GGCAACCTCG GCAATTCGGC GGTGACCTCG
TCCATCGAGG AAGAGGTCGA CAAGATGACC TGGGGCATCC GCTGGGGCGC GGACACGATC
ATGGATCTTT CCACCGGCCA GAACATCCAC GAGACGCGGG AATGGATCAT CCGCAACTCG
CCGGTGCCCA TCGGCACGGT GCCGATCTAT CAGGCGCTGG AGAAGGTCGG CGGCGTGGCC
GAGAACCTGA CCTGGGAGAT CTTCCGCGAC ACCCTGATCG AGCAGGCGGA ACAGGGCGTG
GACTACTTCA CCATCCATGC CGGCGTACGC CTGCATCACG TGCCGATGAC GGCCAGGCGC
GTCACCGGCA TCGTCTCCCG GGGCGGCTCG ATCATGGCCA AGTGGTGCCT GTACCACCAC
CGGGAAAGCT TCCTCTACAC CCACTTCGAG GACATCTGCG AGATCATGAA GGCCTACGAC
GTCGCGTTCT CGCTGGGTGA CGGCCTGCGT CCCGGTTCCA TCGCCGATGC CAACGACGAG
GCCCAGTTCG CCGAGCTGGA AACCCTGGGC GAGTTGACGC AGCTCGCCTG GAAGCACGAC
GTGCAGGTGA TGATCGAGGG CCCCGGCCAT GTCCCGATGC ACCTGATCAA GGAGAACATG
GACAAGCAAC TGGCGTGCTG CGAGGAGGCG CCGTTCTACA CGCTCGGACC GCTGACCACC
GATATCGCCC CCGGCTACGA TCACATCACT TCGGGCATCG GCGCGGCACA GATCGGCTGG
TACGGCTGTG CCATGCTCTG CTACGTCACC CCCAAGGAGC ACCTGGGCCT GCCCAACAAG
GACGACGTCA AGACCGGCAT CATCACCTAC AAGATTGCCG CCCATGCTGC CGACCTCGCC
AAGGGGCACC CCGGCGCGCA ACGCCGCGAC AATGCGCTGT CCAAGGCGCG CTTCGAATTT
CGCTGGGAAG ATCAGTTCAA CCTGGGGCTC GACCCGGATA CGGCACGCGC CTTCCATGAC
GAGACACTGC CCAAGGACTC GGCCAAGGTG GCACACTTCT GCTCGATGTG CGGACCGAAG
TTCTGCTCGA TGAAGATCAG CCAGGAAGTG CGCGACGCCG TCTCCCGGGA GGGGGACTGG
AGCGACCCGC TCGAGAACAC CCAGGAGGCC ATCGAACAAG GCCTGGAGGA ACAGGCCGAA
CGCTTCCGCC GCTCGGGAAA AACGCTCTAT CGGAAGGTGT GA
 
Protein sequence
MTSPQFLSDT AHVDASAIRP LPGSCKRYVQ GSRPDLRVPF REIALSPTTT SGAAEENPPL 
LVYDTSGPYT DPECAIDLRK GLPALRKDWI DERDDTRWLD GPTSRYGQRR ANDPRLAPLR
FDLTRTPRRA KEGRNVTQLH YARQGIITPE MEFIAIRENQ RRQALGPEEV ERILGHQHPG
QGFGARLPEE ITPAFVRDEV ARGRAIIPCN INHPESEPMI IGRNFLVKIN GNLGNSAVTS
SIEEEVDKMT WGIRWGADTI MDLSTGQNIH ETREWIIRNS PVPIGTVPIY QALEKVGGVA
ENLTWEIFRD TLIEQAEQGV DYFTIHAGVR LHHVPMTARR VTGIVSRGGS IMAKWCLYHH
RESFLYTHFE DICEIMKAYD VAFSLGDGLR PGSIADANDE AQFAELETLG ELTQLAWKHD
VQVMIEGPGH VPMHLIKENM DKQLACCEEA PFYTLGPLTT DIAPGYDHIT SGIGAAQIGW
YGCAMLCYVT PKEHLGLPNK DDVKTGIITY KIAAHAADLA KGHPGAQRRD NALSKARFEF
RWEDQFNLGL DPDTARAFHD ETLPKDSAKV AHFCSMCGPK FCSMKISQEV RDAVSREGDW
SDPLENTQEA IEQGLEEQAE RFRRSGKTLY RKV