Gene Csal_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3089 
Symbol 
ID4028895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3441953 
End bp3443080 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID637968303 
Productperiplasmic binding protein 
Protein accessionYP_575132 
Protein GI92115204 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCACCT CACTATTCAA AGCCGCCGCC GGCGCAGTCC TCACCCTCGC TGCCACGACC 
GCGCTGGCCG AAGAGATCAC CGTCACTGAC GTCGCCGGCC GCCAGGTCAC CGTCGACGCT
CCCGTGAATC GCCTGATTCT CGGTGAGGGG CGCCAGATCT ATCTGCTCGG CGCGCTGCAG
CCGGAGACCC CCTTCGAGCA CGTGGTGGGC TGGCGCGAGG ACTTCTCGCA GGCCGATCCG
GACAACTACG CCGCCTATGC CGCCAAGTTT CCCGAGATGA AGCAGATCCC CACCTTCGGT
GGCTTCAAGG ACGGCACCTT CGATGTGGAG CAGGCTGCTG CGCTACAGCC CGACGTCGTG
CTGATGAACC TGGAGGCCAA GGCCGCCACC GAGGACGCCG CCTACGACGA CAAGCTGGCC
GAACTGGGCA TCCCGATCGT CTACGTGGAC TTCCGCGAGG CGCCGCTCGA ACACACGACG
CCTTCCATGC GACTGATCGG CCGGCTACTC GGCGAGGAAG AAAGGGCCGA GGCCTTCATC
GACTATTCAC AGGCCCAGAT GGCGCGCGTC GCCGAGACCA TCGAAACTGC CGACCCCCAG
CGTCCCCGGG TCTTCATCGA TCGTGCCGGC GGCTATTCCG ACGACTGCTG CATGAGCTTC
GGCCCGGGCA ACTTCGGTAA ATACGTCGAG CTCGCCGGGG GGAGCAACAT CGCCGACGGC
ATCATTCCCA ACACCTTCGG CCGGCTGAAC CCGGAGCAGA TCATCGCCGC CGACCCGCAA
CAGGTGGTCG TGACCGGCGG CCACTGGGAC GCCTACGTGC CCGGCGGCGA CTGGGTGGGC
GTGGGCCCCG GCGCCGACCT GGCGGCCGCG CGGACCAAGC TCGAAGGGCT CACCGAGCGC
ACCGCCATGG CCGGCATCGA CGCCGTGCAG ACCGACAATT TTCACGCCAT CTGGCACCAG
TTCTACAACA GCCCCTACTA CTTCGTCGCC GTGCAGCGGC TGGCCAAGTG GTTCCACCCC
GAGCTGTTCG CCGACCTCGA CCCCGAGGCG ACGCTGCGGG AGTTGCACGA ACGCTTCCTG
CCGGTGGACT ACGTGCCGGG CTACTGGGTC TCGCTGAAGG GTGACTGA
 
Protein sequence
MLTSLFKAAA GAVLTLAATT ALAEEITVTD VAGRQVTVDA PVNRLILGEG RQIYLLGALQ 
PETPFEHVVG WREDFSQADP DNYAAYAAKF PEMKQIPTFG GFKDGTFDVE QAAALQPDVV
LMNLEAKAAT EDAAYDDKLA ELGIPIVYVD FREAPLEHTT PSMRLIGRLL GEEERAEAFI
DYSQAQMARV AETIETADPQ RPRVFIDRAG GYSDDCCMSF GPGNFGKYVE LAGGSNIADG
IIPNTFGRLN PEQIIAADPQ QVVVTGGHWD AYVPGGDWVG VGPGADLAAA RTKLEGLTER
TAMAGIDAVQ TDNFHAIWHQ FYNSPYYFVA VQRLAKWFHP ELFADLDPEA TLRELHERFL
PVDYVPGYWV SLKGD