Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0729 |
Symbol | |
ID | 4028264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 817355 |
End bp | 818446 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637965899 |
Product | extracellular solute-binding protein |
Protein accession | YP_572789 |
Protein GI | 92112861 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.706674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTC GTCGTGTGGC GTTCGCCTGG CTGCTGGGGG CCCTGGCGCC GAGCCTCGCC TGGGCCGACA CCCCGCTCAT CGTCGAGGCT GCGCTCGACA GGGAGGTCGC GGCGCCTCTG CTGGATGCCT TCGAGGCCGC CTATCCGCAC ATCCAGCTGA CCTTCCGCGA CCGCTCGACG CTCGAGGTCG ACGCCCGAAT CGCCGACACG AACGATCCGC CGGATGTCGT CATCAGCTCC GCCATGCCCT GGCAGATGGC GCGCGTCAAC GAGGGCTACG CGCGGCGTCT CGACTCCCCG GCGGCGCGGG AGTGGCCGGC GTGGGCCAAG TGGCGCAACG AGGTCTTCGG CTTCACCTTC GAGCCCATCG TCATGGCCTA TCGGCTGGAT CTGGCGCGAC ACATGCTGCC GCCGACGACC CACGCCGACC TGCACACGCT GCTCACCCAG CAACGCGGAA CGCTGCGCGA CAAGGTGACG ACCTATTCGC CCGTGCGCAG CGGCATCGGC TATACGCTGT TCCAGCAGGA TGCCCGCTAC ACCACCCGGT TCTGGGACCT GGTGGCCGCA ATGGGGGCGG CCGACGCCAA TCTCGAGGCC AACACCCGCT CCATGCTCGA GGGGCTCAGC GAAGGGCGCT ACTGGCTGGG CTACAACCTG CTCGGCTCCT ACGCCATGCA CTGGGCGCAA TCGCATCCCG AGGTCTTCGT GCAGGTGCCG CAGGACTATT CGCTGGTGAT GATGCGCATG GCGTTCATCC ACCGCGACGC CCCGCACCCC GCGGCGGCCC GGGCGTTTCT CGACTTCCTG CTGGGCCGCG AAGGGCAACG TGTCATCGCC GGCGAGACGC CGCTGTTCAG CGTGCGCTCG GATGTGGTGG GCCCCTACAC CGCCCAGCGG CTGCGCGACC AGGTCGGGGA GCGCCTGTAC CCCATCCCGC TGAATACCTC GCTGCTCGCC TTCGTCGACC CCTCACGGCG CGCGGCGTTC ATGGATCGCT GGCAGCGGGA ATTCCGTCGC CTGACGCACC CCGCACCGGC CCCGAGCGCG GCCGCCACCG AACCGCGCAT TGACAAACGC GCGCAACGTT GA
|
Protein sequence | MSIRRVAFAW LLGALAPSLA WADTPLIVEA ALDREVAAPL LDAFEAAYPH IQLTFRDRST LEVDARIADT NDPPDVVISS AMPWQMARVN EGYARRLDSP AAREWPAWAK WRNEVFGFTF EPIVMAYRLD LARHMLPPTT HADLHTLLTQ QRGTLRDKVT TYSPVRSGIG YTLFQQDARY TTRFWDLVAA MGAADANLEA NTRSMLEGLS EGRYWLGYNL LGSYAMHWAQ SHPEVFVQVP QDYSLVMMRM AFIHRDAPHP AAARAFLDFL LGREGQRVIA GETPLFSVRS DVVGPYTAQR LRDQVGERLY PIPLNTSLLA FVDPSRRAAF MDRWQREFRR LTHPAPAPSA AATEPRIDKR AQR
|
| |