Gene Csal_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2100 
Symbol 
ID4029243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2368696 
End bp2369739 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID637967299 
ProductABC transporter related 
Protein accessionYP_574150 
Protein GI92114222 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1135] ABC-type metal ion transport system, ATPase component 
TIGRFAM ID[TIGR02314] D-methionine ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.182492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAC TCGAAGGTGT CTCCAAAACC TATGGCGCCG GCCCCACGGC GGTCCACGCC 
CTCAAAAACA TCGACCTTGA CGTCCCGCAG GGCGCCATTC ACGGCGTCAT CGGCCTTTCG
GGGGCCGGCA AGTCGACGCT GATACGTTGC GTCAATCTGC TCGAGCGTCC CACGTCGGGC
CGCGTCATCG TCGACGGCCA GGACCTGACC CGACAGGATG CCGAGGCATT GCGGCAATCG
CGTCATCAAC TGGGCATGAT CTTCCAGCAC TTCAATCTGC TGGCCTCGCG CACCGTTTTC
GATAACGTGG CCCTGCCTCT GGAGTTGATG GGTGTGTCGA AGAGTGACAT TCGCGAGCGC
GTCGAGCCAC TGCTCGACCT GACCGGGCTG ACCGACAAGG CGCGACAGTA TCCGGCCCAG
CTCTCCGGCG GCCAGAAACA GCGCGTGGCC ATCGCGCGGG CTCTCGCCAG CCGCCCCAAG
GTATTGCTGT GCGACGAGGC GACCTCCGCG CTCGACCCCC AGACCACGGC TTCGATTCTC
GAGCTGCTGC AGGACATCAA CCGCAAGCTG GGCCTGACCA TTCTGCTGAT CACCCACGAA
ATGGAAGTGG TCAAGAGCAT CTGCCATCGC GTCGGCCTGA TCTCCGACGG CGAACTGGTG
GAAGAAGCCG ATGTCGGCGA TTTCTTCACG GCGCCCGCCA CGCGTCTGGG ACGTGATTTC
CTCAACGCCT TCCTCGAGCT CGAGCCGCCC CAGGCCCTGG TCGAACGCCT CGAGGAGACA
GCCGGTCCTC ACACCCACCC TGTCGTGCGA CTGGCATTCT CCGGCGCCAC GGTCGCGACA
CCGCTCATTT CGCGCCTGGC CCGCGACAGC GGCGTCGACG TCAGCATCCT GCAGGCCAAG
GTGGAGTCGA TCCAGGGACG CACGCTCGGC CTGATGATCG CCGAGCTCAT CGGCTCGCCC
GACACGACGT CGCGGGCACT CACGCAACTC GAAGCACACG ATATCAACGT GGAGGTACTC
GGCCATGTCC AGCGCGATGC TTGA
 
Protein sequence
MIKLEGVSKT YGAGPTAVHA LKNIDLDVPQ GAIHGVIGLS GAGKSTLIRC VNLLERPTSG 
RVIVDGQDLT RQDAEALRQS RHQLGMIFQH FNLLASRTVF DNVALPLELM GVSKSDIRER
VEPLLDLTGL TDKARQYPAQ LSGGQKQRVA IARALASRPK VLLCDEATSA LDPQTTASIL
ELLQDINRKL GLTILLITHE MEVVKSICHR VGLISDGELV EEADVGDFFT APATRLGRDF
LNAFLELEPP QALVERLEET AGPHTHPVVR LAFSGATVAT PLISRLARDS GVDVSILQAK
VESIQGRTLG LMIAELIGSP DTTSRALTQL EAHDINVEVL GHVQRDA