Gene Csal_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1747 
Symbol 
ID4028274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1990520 
End bp1991584 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content62% 
IMG OID637966935 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_573798 
Protein GI92113870 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA CCACAACGAC ACGCTTCGTT TCCGTCGTGG GCGGCGCCGT TCTCGGCTTG 
ACCGCGTTCA GCGCCTCTGC GGCGACCGAA ATGCCTCCGC TGCCGGAGGT CAAAGGTGAG
AAGGTCGAGG CCGACGGCAA TTTCAAGATC AAGTTCAGCA TCGGCACCAC GGAATCCGGC
GCGCAATACC GCGGCCTCGA ATATTTCGAG AAGATCGTCG AACAACGCAG CGACGGCAAC
ATTCAGGTCG AACTGTTCCC GGGCGCCCAG TTGGGCGATG ACCGCCAGGC CACCAGTGCG
CTCCAGTCGG GCACGCTGGA AATGACCATG CCGTCGACGT CTCCGCTGGT GAACATGTTC
CCGGAATTCG CGGTGTTCGA CCTCCCCTTC CTCTTCCCGC AGCCTGAAAT GGCGGATGCG
GTACTCGACG GCGAGATCGG CCAGCAGATG CTCGAAGACG CGTCCTCGCA AGGCCTGGTG
GCGATCGGCT GGGGTGAAAA CGGTTACCGT CAGCTGACCA ACAGCCAGCA CCCGGTCGAG
GAGCCGGCGG ACCTCGACGG CCTGAAGATC CGTACCATGG AGAACGATCT CCACCTGGAT
ATCTGGCGCA CCCTGGGGGC CAACCCGACG CCGATGTCCT TCGCGGAGCT GTTCACGGCG
CTCGAGCAAG GGGTCGTCGA CGGTCAGGAA AATCCGTGGA TCACCATCGA ATCCTCCAAG
TTCAACGAGG TGCAGGACTA CGCCACCGAA ACCAACCACG TCTACACACC GTTCATCACG
CTGGTCTCCG CGCGTTTCTG GGATCGTCTG CCGGAAGACT ACCAGCAGCT GCTGCGCGAC
GCGGCCACCG AGATGGGCGA CTATCAGCGC CACGTCAGCC GCACGCTGAA CGATCAGATC
AAGCAGGATC TGAAGGATTC CGGCATGCAG ATCACCGAGC TGACGCCGGA GCAGGTCAAG
GTCTTCCAGG ACAAGCTGGA GCCGGTGTAT GAAGACTGGC GCGACCAGAT CGGCGGCGAG
CTGATCGACG ATATCCGCGC CCAGGTGGAA CAAGCGCAAG AGTAA
 
Protein sequence
MTTTTTTRFV SVVGGAVLGL TAFSASAATE MPPLPEVKGE KVEADGNFKI KFSIGTTESG 
AQYRGLEYFE KIVEQRSDGN IQVELFPGAQ LGDDRQATSA LQSGTLEMTM PSTSPLVNMF
PEFAVFDLPF LFPQPEMADA VLDGEIGQQM LEDASSQGLV AIGWGENGYR QLTNSQHPVE
EPADLDGLKI RTMENDLHLD IWRTLGANPT PMSFAELFTA LEQGVVDGQE NPWITIESSK
FNEVQDYATE TNHVYTPFIT LVSARFWDRL PEDYQQLLRD AATEMGDYQR HVSRTLNDQI
KQDLKDSGMQ ITELTPEQVK VFQDKLEPVY EDWRDQIGGE LIDDIRAQVE QAQE