Gene Csal_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3004 
Symbol 
ID4028970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3343454 
End bp3344806 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content64% 
IMG OID637968210 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_575047 
Protein GI92115119 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.853487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACA ACATCAACAA GCTCACACTC GCCGTGACGC TGGCCTCTAC CGTGATCGCC 
TCCGGCCATG CCGCTGCGTC TGGCTTCCAG GTCCGCGAAC AGAGCGCCAA GGCACTCGGC
AACGCCATGG CCGGGGCGGC GGCCGGTGCC GAGGACGTCA GCTACATGAC CTACAACCCG
GCGGCCATCG GTAACGTCGA GGGCACTCAG GTGGCCGGCG GCATTTCCTA CATCGATGCC
AGCTTCGAAT TGACCGACGC CTCGGCGAGC TCCGTCACCG GGGCCAGTTA CATGGGAAAT
GGCAATAACG AAGGGGGCGA GGAGGCCTGG GTACCCAGTT TTGCCTTCAA GACCCGTCTC
GACGAGCGCT TCGATCTTGG CCTGGCGATC TCCGCGCCGT ACGGTCTCTC GACGAAGTAT
GACAAGGAAT GGATCGGGCG TTATCACGCC ATCGAAACCG AGCTCAAGAC CATCGACATC
CAACCAACGC TGAATTACCG CGCCACCGAC CGCCTCGACC TTGCTGTGGG GCTGCGTGCT
CAGTATGCCG ATGCCACGCT CTCCAACGCC ATCGATCTGG GCGCGGTGGG AGCCGGCGCA
GGTCAGTTGC CGCCCTCGGC GATCGGCAAT GCCGATGGCA TGGCCGAGGT GACCGGTGAC
GACTGGGGCT ACGGCTACAC CCTGGGGGCG CTGTTCCAGG CCACCGAGCG GACCCGTCTG
GGCATCAGCT ACCGCTCGGA GGTCGAGCTG ACGCTGGACG GTGACGTCGA TTACAGCTCC
GACAACGCCG CGGGGCAGGC AGTATTGGCC GGGGCGCAGG CCACCGGTCA GCTGCAGGAC
GGCGGCGGCA AGGCGGAGAT CACCACGCCG GCCAACCTCA ACCTGGGGAT CTACCATCAA
TTGACTGATC GGCTCGCCTT GATGGCCAAT GCCGAATGGA CGGAGTGGAG CAGCTTCGAG
GAGCTGACCG TCGAGTTCGA TAATGGAGGG CAGAGCACCA CGACCGAAAA CTGGGACGAT
ACCTGGGCTT TCTCGGTGGG GGCGAACTAT CAACTCAACC GGCAGTGGCT GCTGCGCGCC
GGCCTGGGGG TCGACGAATC GCCGGTGCCG GACAGCGAAC ACCGCACGCC GCGTGTGCCC
GATGCCGACC GCCGCTGGGC GACGCTCGGG GCCACCTGGA TGCCGACTTC GAACCTGGGC
GTGACGGCGG GCTACATGCA CGTGTTCGGT GACGATGGCG ATATCGATCA GAACGCCACG
CCGACCAACG AGAATGCCAG CCGCGGCAAC CTGTCCGGGA CCTATGAAGT CGAGGCCGAT
GTGTTCGCAC TGTCGATGGA TTATCGCTTC TGA
 
Protein sequence
MHNNINKLTL AVTLASTVIA SGHAAASGFQ VREQSAKALG NAMAGAAAGA EDVSYMTYNP 
AAIGNVEGTQ VAGGISYIDA SFELTDASAS SVTGASYMGN GNNEGGEEAW VPSFAFKTRL
DERFDLGLAI SAPYGLSTKY DKEWIGRYHA IETELKTIDI QPTLNYRATD RLDLAVGLRA
QYADATLSNA IDLGAVGAGA GQLPPSAIGN ADGMAEVTGD DWGYGYTLGA LFQATERTRL
GISYRSEVEL TLDGDVDYSS DNAAGQAVLA GAQATGQLQD GGGKAEITTP ANLNLGIYHQ
LTDRLALMAN AEWTEWSSFE ELTVEFDNGG QSTTTENWDD TWAFSVGANY QLNRQWLLRA
GLGVDESPVP DSEHRTPRVP DADRRWATLG ATWMPTSNLG VTAGYMHVFG DDGDIDQNAT
PTNENASRGN LSGTYEVEAD VFALSMDYRF