Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1721 |
Symbol | |
ID | 4909676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 1604660 |
End bp | 1605958 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640125468 |
Product | general substrate transporter |
Protein accession | YP_001056604 |
Protein GI | 126460326 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.675628 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAA AGGGTAGAGA GGCTGTGGTT ATGTCTCTGC GGTTTATAAC AGCGGCGTCT ACCATCGGCA CGTTGATTGA GTGGTACGAC TTTTTTGCAT ACGCATCAGT GTCGCCCTAC ATATCCGCTA AGTTTTTCCC AAAGGGGGAT CCAATAGTCG CTTCGCTTCT CACGTGGCTC ATATTCGCCA CGGCCTATGT CGTCAGGCCG CTTGGCGCCG TCTTGTTCGG CCACCTGGGC GACAGAGTGG GGAGGAAGGC CACCTTCCTC GCCACATTGA CGCTTATGGG GCTCTCCACG TTTGCCATAG GCCTTTTGCC CACCTACGAA CAGGTCGGGG TACTGGCGCC CTTCCTCCTG GCGCTTTTCA GGATACTCCA AGGCATTTCG CTGGGCGGCG AGTACGGCGG CGCGTTGACG TACGTGTTAG AACACGCCCC TCCCAACCGC AGGGCCTTCT ACGCGGGATT TCTCTCCGCC ACGCCTCCAG CCGGCCTCGC GCTGTCGTCG CTTACTCTTG TCTCAACCTC GCTCCTTCTG TCGCGGGCTG ACATAGAGGC ATGGGGGTGG CGCGTCCCCT TCCTGTTCTC AATTGTGCTT ACTATACTCG GCGTGTATCT AAGGCTTAGG CTAACCGAGA CTCCGCTGTT TGAGGCGGTA AAGAGGGAGG GAAAGGTGGC CAAAGTGCCA GTGGTAGAGG CCTTTACCAA GTACTCAAAG TGGATTGTAG TAGGCGTCGT GATAAGCGCC GGGCACGCGG TGCTGGCTTA CACCTCCACT GGCTACATAT TCACATACTT GACCCAGGTG GCGAAGTTAG ATCCCTTAAC CGTTAACCTT GTCGTCGGCG TGGCGGCGCT CCTCCAGTAC CCATTCTACA TAATAAACGC GTGGCTGGCC GATAAGTATG GGAGGAAGAG GATTTACATG GCGGGGCTGG CACTGGGGCT TCTCACGTAT TACCCAATCT ACCAGTGGCT GGGAACGGCG AGGGGATTGG CAGAGACTAT ATTCGCCGTG TTCATCTTGA TATACGCAAC CGCGTTTACC TTCAGCGTCT TGGGCACGGC CATAGCGGAG CTGTTCCCCA CTAGGGTGAG GTACACAGGA ATGTCTCTGA CATTTAATAT AGGAGTGGGA GTCTTCGGGG GCTTTACTCC CACCGTTGTG CAACTCATAG GCATATCGCT ACAGAACCCG CTGGCCGGGC TGATACTGTA CACCTACGTC GTGGCGGCGC TGGCGCTGGT CGTGGCCTAT TTCTATCTGC CAGAGACTGC CGCCAAGAGG CTGGAGTAG
|
Protein sequence | MFKKGREAVV MSLRFITAAS TIGTLIEWYD FFAYASVSPY ISAKFFPKGD PIVASLLTWL IFATAYVVRP LGAVLFGHLG DRVGRKATFL ATLTLMGLST FAIGLLPTYE QVGVLAPFLL ALFRILQGIS LGGEYGGALT YVLEHAPPNR RAFYAGFLSA TPPAGLALSS LTLVSTSLLL SRADIEAWGW RVPFLFSIVL TILGVYLRLR LTETPLFEAV KREGKVAKVP VVEAFTKYSK WIVVGVVISA GHAVLAYTST GYIFTYLTQV AKLDPLTVNL VVGVAALLQY PFYIINAWLA DKYGRKRIYM AGLALGLLTY YPIYQWLGTA RGLAETIFAV FILIYATAFT FSVLGTAIAE LFPTRVRYTG MSLTFNIGVG VFGGFTPTVV QLIGISLQNP LAGLILYTYV VAALALVVAY FYLPETAAKR LE
|
| |