Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2011 |
Symbol | |
ID | 5055927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1800001 |
End bp | 1800849 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469560 |
Product | extracellular solute-binding protein |
Protein accession | YP_001154210 |
Protein GI | 145592208 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.986691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCAA CAATACTTGC CGTCGTTGCG TTAGTGGTTG GGCTGATCGC CGGATATCTG GTAGGGATGT ACTCAGCGCC TAAGCCAACG GCACCGGCCA CTCAGACCTC CACGCCAGCC GCCGCTCAGT GCCCCGTCTC CGTAGACATA ATAAAGAAGA GGGGTAAGCT AATCCTCGGC ACAGACGCCA CTTGGCCACC CTGGGAATGG GTTGCAAATA ACACATTCGT TGGCTGGGAT ATAGACATCG CCCGCGAAAT TGCCAAGGCC CTGGGCGTAG AGCTGGAGAT ACGCGACATG AGGTTTGCCG GGCTTCTAGA AGCTGTTAGA AAGGGAGATG TGGATCTCGC TCTGAGCGCA ATCACATGGA CTACGGAGAG GGAGAAGGTG CTTGAGTTCT CCATGCCGTA TTACCTAGAA TCAATAGTCA TAGTTACTAA GACAAGCCGC AACGATATCA ATAAGGTGGA GGATCTCTAC GGCAAGAAAG TCGGCGTACA GATCGGCACC ACCCACGACG AGTGGTCCAC TACCAACCTT GAAAAGCCCG GCAAGGCTTC AGTAAGTAGA TACGATAAGG TATACCCATA TATGGTGGAG GTATTGAGGA GGGGGGATGT GGACGCCATT ATCTTGGACA GGTCTATCGC CACGGCGCTG GTGAGGAAGT TCCCCGACTT GAAGATAGCC TTTGAGCTAC CCGGCTCTGC CGGCTATATA TCAGTCGCCA TGCCTAAGTG TGCCCAGGAC CTGAAACTTG TAGTAGACCA GGTGATCGAG AACTTGATTC AGACAGGGAA ACTCGACGAG ATAATGCAGA AGAACTTCGA GCTGTTCCTC AAGTCGTAA
|
Protein sequence | MKSTILAVVA LVVGLIAGYL VGMYSAPKPT APATQTSTPA AAQCPVSVDI IKKRGKLILG TDATWPPWEW VANNTFVGWD IDIAREIAKA LGVELEIRDM RFAGLLEAVR KGDVDLALSA ITWTTEREKV LEFSMPYYLE SIVIVTKTSR NDINKVEDLY GKKVGVQIGT THDEWSTTNL EKPGKASVSR YDKVYPYMVE VLRRGDVDAI ILDRSIATAL VRKFPDLKIA FELPGSAGYI SVAMPKCAQD LKLVVDQVIE NLIQTGKLDE IMQKNFELFL KS
|
| |