Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4054 |
Symbol | |
ID | 5541565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5260684 |
End bp | 5261997 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640896166 |
Product | major facilitator transporter |
Protein accession | YP_001434104 |
Protein GI | 156743975 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0519542 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAGA TTGCTATGAT GCACACTCCC CGTTCAGGCT ACCGCTGGTT CGTGATTGCA GTCTTTTTTT GCTTCATGCT GCTGCACCAG GCGGATAAAC TGCTGATCGG ACCGCTGACC CCCGCGATTA TGGATGAATT CGGCATCACC ATGACTCAGA TGGGGGCGGT GACGACCGGC GCGCTGGTGG TGGCATCAAT CCTCTATCCC ATCTGGGGCT ACCTGTACGA CCGGTTTGCC CGCGCGCGGT TGCTGGCGCT GGCATCGTTC ATCTGGGGCG CGACGACCTG GTTGAGTGCA ATCGCGCGTA CTTACCCGAC GTTCCTGGCA GCGCGCGCCT CGACCGGCAT TGATGACTCG TCGTACCCCG GCATGTACGC GCTGGTTGCC GATTATTTCG GTCCCAACCT GCGCGGCAAA GTGTATGGGC TGTTGCAGCT GGCGCAGCCA ATCGGCTACC TGATCGGCAT GGTGTTGGCG TTGATGCTGG CGCCGCAGAT CGGATGGCGC ACCATATTTT TTTTCACTGG CGGGTTGGGA ATTGTGGTCG CGCTCGTCAT TTTGTTGGGC GTCCGCGAAA TGCCGCGCGG CAAAGCGGAA CCAGAGTTCG AGGGAATGAC CGAGATGGCG CGCTTCCGTT TCTCGTGGGC GGAGATGCGC GCCGTGCTGG GGAAACGCAC GATGTGGTTC GTCTTTCTCC AGGGATTTGC CGGCGTCTTC CCGTGGAATG TCATCACTTA CTGGTTCTTC ACCTACCTGG CGCGTGAGCG CGGCTACGAC GAAAGCAGCA TTTTGCTGAC CGTTGCGCCC GTCATCCTGA TTCTGGCGAG CGGCAGTTTC ATCGGTGGGG TATTGGGTGA CTGGGCATTC AAACGCACCA CGCGCGGACG GATCATCGTG TCGAGCATTG GCGTGCTCAT GGGAGCGATT TTCCTGTATC TGGCGATGCA AACGCCGGTC GAAGCGCGCA CGACGTTCTT CGTGCTCATG TGCCTGACGG CGCTCTTCAT GCCGCTTTCA TCACCCAATG TCATTGCTAC GGTGTATGAT GTGACGGTGC CGGAGGTGCG CAGTACGGCT CAGGCGGTCG AATATTTCAT CGAGAACAGC GGTGCGGCGC TGGCGCCGCT TCTGGCGGGC ATTATTGCAG ATATGTACAA CCTGCAAACC GCCATTACGT GGATCTGCGT CACTGCCTGG GCGCTCTGCT TTATGTTCTA TCTTGGCGCG TTGCGCTACA TTGAGCGCGA CCACCATGCT CTGCGCGATG AGATGGGGCG TCGCGCAGCA TCCTTCCGGC AGACGATGGC GTAG
|
Protein sequence | MSEIAMMHTP RSGYRWFVIA VFFCFMLLHQ ADKLLIGPLT PAIMDEFGIT MTQMGAVTTG ALVVASILYP IWGYLYDRFA RARLLALASF IWGATTWLSA IARTYPTFLA ARASTGIDDS SYPGMYALVA DYFGPNLRGK VYGLLQLAQP IGYLIGMVLA LMLAPQIGWR TIFFFTGGLG IVVALVILLG VREMPRGKAE PEFEGMTEMA RFRFSWAEMR AVLGKRTMWF VFLQGFAGVF PWNVITYWFF TYLARERGYD ESSILLTVAP VILILASGSF IGGVLGDWAF KRTTRGRIIV SSIGVLMGAI FLYLAMQTPV EARTTFFVLM CLTALFMPLS SPNVIATVYD VTVPEVRSTA QAVEYFIENS GAALAPLLAG IIADMYNLQT AITWICVTAW ALCFMFYLGA LRYIERDHHA LRDEMGRRAA SFRQTMA
|
| |