Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0473 |
Symbol | |
ID | 3832411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 475451 |
End bp | 476380 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828407 |
Product | phosphate binding protein |
Protein accession | YP_429346 |
Protein GI | 83589337 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | [TIGR02136] phosphate binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGAGTA AAGGTAAATG GCTGGCGATC TTAGTGATTG CCCTGGTGGC GGTACTGGCT GTAGCGGGCT GCGGTAAGAA AGAGTTCCCG GCTCCCCAGC AGGGTAGCAG CCAGCAAAAC AGCAACCAGC AGTCCGGCGG GAGCGGCGCC ATAACAGCAG CGGGTTCTAC GGCCCTGCAA CCCCTGGTAG ATGAAGCGGC CAAGCAGTAT ATGGAAAAGA ACCCGGGTGT TCGCATCGTC GTCAACGGCG GCGGCAGCGG TAACGGCCTC TCTCAGGTGT TCCAGGGTGC CGTCCAGATA GGCAACTCCG ATATCTTTGC CGAGGAAAAG GACGGTATCG ACGCCTCCCA ACTGGTGGAT CACAAGGTAG CCGTTGTGGG CATGGCGGCC GTTGTTAATC CTGACGTCAA AGTGGACAAC CTTACCCAGC AGCAGCTTAT TGACATCTTT ACCGGTAAAA TTACCAACTG GAAAGACGTC GGCGGTCCTG ACCAGAAGAT CAACATCGTC AACCGGCCCA AGGGTTCCGG TACCCGGGCG ACCTTTAAAA AATATGCCTT AAACGGTGCG GAAGAAGCCC AGGGTATCGC CATGGAGCAG GATGCCTCCG GTACCGTGCG CAAGACCATT GCCGAAACCC CAGGCGCTAT CGGCTACCTG GCTCTCTCCT ACATTGATTC CAGCGTCCGG CCCCTCAAAA TTGACGGGGT TGAACCTACG GCAGCCAATA TTGTGGACAA CAAGTACAAG GTCTGGGCAT ACGAGCATAT GTACACCAAA GGCCAGCCGA CGGGGGAAGT CAAGAAATTC CTGGATTTCA TAATGAGCGA CGAAGTCCAG AAGAACCTGG TAACCAAGAT GGGCTACATC CCCGTAACCG ACATGAAGGT TGAACGCGAC GCCCAGGGAA ATGTGACTCC CAAGAAGTAG
|
Protein sequence | MLSKGKWLAI LVIALVAVLA VAGCGKKEFP APQQGSSQQN SNQQSGGSGA ITAAGSTALQ PLVDEAAKQY MEKNPGVRIV VNGGGSGNGL SQVFQGAVQI GNSDIFAEEK DGIDASQLVD HKVAVVGMAA VVNPDVKVDN LTQQQLIDIF TGKITNWKDV GGPDQKINIV NRPKGSGTRA TFKKYALNGA EEAQGIAMEQ DASGTVRKTI AETPGAIGYL ALSYIDSSVR PLKIDGVEPT AANIVDNKYK VWAYEHMYTK GQPTGEVKKF LDFIMSDEVQ KNLVTKMGYI PVTDMKVERD AQGNVTPKK
|
| |