Gene Csal_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2154 
Symbol 
ID4026494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2423631 
End bp2424815 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content65% 
IMG OID637967359 
Producttetratricopeptide repeat protein 
Protein accessionYP_574204 
Protein GI92114276 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00078802 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATC CACTGCTGCT TGGCCTGCTG GTCGCGGCGA TTGCCATCGG TTGGTGGCTG 
GGACGGCGCG AGCGCCGGCG CGCTCGCGAA CAACCCGCTC CCGGCGTGTC CCTGCCCCGG
GACTATTTCA TCGGCCTCAA CCACTTGCTC AATGAGCAGC CGGATCGTGC CATCGAGACG
TTCGTGCAAG CGCTCGAGGT CAACAGCGAC ACCATCGAAA CCCATATCGC GCTGGGCAAT
CTGTTCCGCT CTCGCGGCGA AGCCGATCGT GCCGTCAAGA TTCATCAGAA CCTCCTGGCC
CGCCCTACCC TGACCCCCCA TCAGGGCGGC CTGGTGCAGC TGGAACTCTC GCGCGACTTC
CTCCACCTCG GCTTGCTCGA CCGCGCCGAA CGCTTGCTGC AGAGCCTCGT CAAGGACGTC
CCCGATGAAG GCCTTCGCGA TGCCGCCAAG CGCTTGCTGG TCGACCTCCT GCAGCGCGAG
AAGGAATGGC AGCAGGCGCT GGATGTCGCC ACGCCGCAGC TCGTCCGCCA GGACGCGGAC
ATCCAGCGCG CCGCAGCGCA CTGGCTATGC GAACTCGCCC AGCGCGACCG GCACAATGCC
AGCCCCGGCA TGGCACGCAA GCGCCTGCGC CAGGCCCTGT CGATCGACGA AAGCTGCGTA
CGGGCCACTT GGCTGCTGGC CGAGTTGGAA CGGGATACCG GCCACTACAA GGCCGTGATT
CGCCATCTCA AGCGCATCCC CGAGCAGGAC CCGGGCTTTC TGCCGGTCAT CCTCGAGCCC
TTGCATGAGG CCTATCGACG CCTGGAAGAC GAGTCGGGCT GGCGCGCATT TCTCGAGACC
AAGCTGCAGG AAACGCACTT CACCAGCGTC GTGATCATGC TCGCGGCGTC CCGACTCAAT
ACCGAAGGCC AGGACGCGGC CATCGAATTG ATCAGCGAAC AGCTCCAAGC GCACCCCAGC
CTGCGCGGCG TCGATTACCT CATGGACCTG TACCTGATCG GCGAAAATCA CGGCGACCGC
GAACGCCTGC TGCTGCTCAA GCAACATACT CAGAAACTGC TGGCAGCACG CCCCCGTCAT
CGCTGTCATC GCTGTGGCTT CGGCAGCGCA CAACTGCATT GGCAATGCCC GCGCTGCCAG
AGCTGGGGCA CCACCAAGCC GATCACGGGA CTGGAAGGCG AGTAG
 
Protein sequence
MLDPLLLGLL VAAIAIGWWL GRRERRRARE QPAPGVSLPR DYFIGLNHLL NEQPDRAIET 
FVQALEVNSD TIETHIALGN LFRSRGEADR AVKIHQNLLA RPTLTPHQGG LVQLELSRDF
LHLGLLDRAE RLLQSLVKDV PDEGLRDAAK RLLVDLLQRE KEWQQALDVA TPQLVRQDAD
IQRAAAHWLC ELAQRDRHNA SPGMARKRLR QALSIDESCV RATWLLAELE RDTGHYKAVI
RHLKRIPEQD PGFLPVILEP LHEAYRRLED ESGWRAFLET KLQETHFTSV VIMLAASRLN
TEGQDAAIEL ISEQLQAHPS LRGVDYLMDL YLIGENHGDR ERLLLLKQHT QKLLAARPRH
RCHRCGFGSA QLHWQCPRCQ SWGTTKPITG LEGE