Gene Csal_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0020 
Symbol 
ID4027339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp24625 
End bp26139 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content64% 
IMG OID637965172 
ProductL-threonine ammonia-lyase 
Protein accessionYP_572084 
Protein GI92112156 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.381683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGAAG AGACCGTCAA GAAAATCCTC CAGGCTCGCG TTTACGAAGC GGCCCGGGAA 
ACGCCGATAT CCCCTGCTCC CTTTCTCTCC CGCCGTCTCA ACAACACGAT TCTGATCAAG
CGCGAGGATT TGCAGCCGGT CTATTCCTTC AAGATTCGCG GCGCCTACAA CAAGATGGCC
CAGCTGAGCG ATGAACAGAA GGCCAAGGGC GTGATCGCCG CGTCCGCCGG CAACCATGCC
CAGGGCCTGG CCATGGCCGC CAAGCAGATG GGCGTCAAGG CGGTCATCGT GATGCCGCGC
ATCACGCCCG ACATCAAGGT CCAGGCCGTG CGCGCGCGTG GCGCCAAGGT CGTGCTCAAG
GGCGATGCCT TCGGCGAGGC GCTGGCACAT GCGCGCGAGC TGATCGACGA GCATGGCTAC
ACCTACATTC CGCCCTTCGA CGATAACGAC GTGATCGCCG GCCAGGGCAC GGTGGGCATG
GAGATCCTGC GTCAGCACAG CGGACCGCTG GACGCGGTAT TCGTGCCCGT GGGCGGTGGC
GGCCTGCTCG CCGGCGTGGT GGCGTACATC AAGTACCTGC GCCCCGAGAT CAAGGTGTAC
GGGGTCGAGG CGGAAGACGC TGCCTGCCTC AAGGCGGCCC TGGAAGCCGG CGAACGGGTC
ACCCTCGACC AGGTCGGCGT GTTCGCCGAG GGCGTCGCCG TGGCGCAGAT CGGCGAAGCG
CCGTTCGAGA TCCTGCGCCA CTGGGTGGAT GGCGTGATCA CCGTCACCAC CGATGAGATG
TGCGCGGCGG TCAAGGACAT CTTCGAGGAT ACGCGGGCGG TCGCCGAGAC CTCCGGCGCG
CTGTCGCTGG CGGGGCTCAA GAAATACATC CAGCAGCAGA ACGCCGAGGG CGAGACCCTG
CTGTGCATCA ACTCGGGCGC CAACACCAAT TTCGATCGTC TGCAGCACAT CGCCGAGCGC
ACGGAGCTGG GCGAGCAGCG CGAGGCGATT CTGGCGGTGA CGATTCCCGA ACGGCCGGGC
AGTTTCAAGA AATTCTGCAA GACCATCGGC AAGCGCATGG TCACCGAGTT CAATTACCGC
TATGCCGACC CCGACCACGC GCACATCTTC GTCGGCGTGC AGGTCAAGCC GGGCGGCGAG
GACCGCCAGG CGGTGATCGA CAAGTTGCGC GAGGCCGGTT ATCCGGTGGA GGACCTCACC
GACAACGAAC TGGCCAAGCT GCATATTCGC CATCTCGGTG GCGGGCGTCC CAAGGAGCAC
TTCAGCGAAG AAGTCTACCG GTTCGAGTTC CCCGAACGCC CCGGGGCGCT GATGAACTTC
CTGACTCATC TGCCCGGCGA CTGGAACATT TCACTGTTCC ACTACCGCAA CCATGGCGCG
GCGTATGGCC GAGTGCTGGT GGGCATGCAG ATCCCCAATG GCGCCCGGGC GCATGTCGAG
GAACATTTCG AACGCATCGG CTATCGCTAC TGGAAGGAAT CCGACAATCC CGCCTATCGT
CTGTTCATGG CCTGA
 
Protein sequence
MLEETVKKIL QARVYEAARE TPISPAPFLS RRLNNTILIK REDLQPVYSF KIRGAYNKMA 
QLSDEQKAKG VIAASAGNHA QGLAMAAKQM GVKAVIVMPR ITPDIKVQAV RARGAKVVLK
GDAFGEALAH ARELIDEHGY TYIPPFDDND VIAGQGTVGM EILRQHSGPL DAVFVPVGGG
GLLAGVVAYI KYLRPEIKVY GVEAEDAACL KAALEAGERV TLDQVGVFAE GVAVAQIGEA
PFEILRHWVD GVITVTTDEM CAAVKDIFED TRAVAETSGA LSLAGLKKYI QQQNAEGETL
LCINSGANTN FDRLQHIAER TELGEQREAI LAVTIPERPG SFKKFCKTIG KRMVTEFNYR
YADPDHAHIF VGVQVKPGGE DRQAVIDKLR EAGYPVEDLT DNELAKLHIR HLGGGRPKEH
FSEEVYRFEF PERPGALMNF LTHLPGDWNI SLFHYRNHGA AYGRVLVGMQ IPNGARAHVE
EHFERIGYRY WKESDNPAYR LFMA