Gene Csal_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0729 
Symbol 
ID4028264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp817355 
End bp818446 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID637965899 
Productextracellular solute-binding protein 
Protein accessionYP_572789 
Protein GI92112861 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.706674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTC GTCGTGTGGC GTTCGCCTGG CTGCTGGGGG CCCTGGCGCC GAGCCTCGCC 
TGGGCCGACA CCCCGCTCAT CGTCGAGGCT GCGCTCGACA GGGAGGTCGC GGCGCCTCTG
CTGGATGCCT TCGAGGCCGC CTATCCGCAC ATCCAGCTGA CCTTCCGCGA CCGCTCGACG
CTCGAGGTCG ACGCCCGAAT CGCCGACACG AACGATCCGC CGGATGTCGT CATCAGCTCC
GCCATGCCCT GGCAGATGGC GCGCGTCAAC GAGGGCTACG CGCGGCGTCT CGACTCCCCG
GCGGCGCGGG AGTGGCCGGC GTGGGCCAAG TGGCGCAACG AGGTCTTCGG CTTCACCTTC
GAGCCCATCG TCATGGCCTA TCGGCTGGAT CTGGCGCGAC ACATGCTGCC GCCGACGACC
CACGCCGACC TGCACACGCT GCTCACCCAG CAACGCGGAA CGCTGCGCGA CAAGGTGACG
ACCTATTCGC CCGTGCGCAG CGGCATCGGC TATACGCTGT TCCAGCAGGA TGCCCGCTAC
ACCACCCGGT TCTGGGACCT GGTGGCCGCA ATGGGGGCGG CCGACGCCAA TCTCGAGGCC
AACACCCGCT CCATGCTCGA GGGGCTCAGC GAAGGGCGCT ACTGGCTGGG CTACAACCTG
CTCGGCTCCT ACGCCATGCA CTGGGCGCAA TCGCATCCCG AGGTCTTCGT GCAGGTGCCG
CAGGACTATT CGCTGGTGAT GATGCGCATG GCGTTCATCC ACCGCGACGC CCCGCACCCC
GCGGCGGCCC GGGCGTTTCT CGACTTCCTG CTGGGCCGCG AAGGGCAACG TGTCATCGCC
GGCGAGACGC CGCTGTTCAG CGTGCGCTCG GATGTGGTGG GCCCCTACAC CGCCCAGCGG
CTGCGCGACC AGGTCGGGGA GCGCCTGTAC CCCATCCCGC TGAATACCTC GCTGCTCGCC
TTCGTCGACC CCTCACGGCG CGCGGCGTTC ATGGATCGCT GGCAGCGGGA ATTCCGTCGC
CTGACGCACC CCGCACCGGC CCCGAGCGCG GCCGCCACCG AACCGCGCAT TGACAAACGC
GCGCAACGTT GA
 
Protein sequence
MSIRRVAFAW LLGALAPSLA WADTPLIVEA ALDREVAAPL LDAFEAAYPH IQLTFRDRST 
LEVDARIADT NDPPDVVISS AMPWQMARVN EGYARRLDSP AAREWPAWAK WRNEVFGFTF
EPIVMAYRLD LARHMLPPTT HADLHTLLTQ QRGTLRDKVT TYSPVRSGIG YTLFQQDARY
TTRFWDLVAA MGAADANLEA NTRSMLEGLS EGRYWLGYNL LGSYAMHWAQ SHPEVFVQVP
QDYSLVMMRM AFIHRDAPHP AAARAFLDFL LGREGQRVIA GETPLFSVRS DVVGPYTAQR
LRDQVGERLY PIPLNTSLLA FVDPSRRAAF MDRWQREFRR LTHPAPAPSA AATEPRIDKR
AQR