Gene Rcas_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4054 
Symbol 
ID5541565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5260684 
End bp5261997 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content59% 
IMG OID640896166 
Productmajor facilitator transporter 
Protein accessionYP_001434104 
Protein GI156743975 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0519542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGA TTGCTATGAT GCACACTCCC CGTTCAGGCT ACCGCTGGTT CGTGATTGCA 
GTCTTTTTTT GCTTCATGCT GCTGCACCAG GCGGATAAAC TGCTGATCGG ACCGCTGACC
CCCGCGATTA TGGATGAATT CGGCATCACC ATGACTCAGA TGGGGGCGGT GACGACCGGC
GCGCTGGTGG TGGCATCAAT CCTCTATCCC ATCTGGGGCT ACCTGTACGA CCGGTTTGCC
CGCGCGCGGT TGCTGGCGCT GGCATCGTTC ATCTGGGGCG CGACGACCTG GTTGAGTGCA
ATCGCGCGTA CTTACCCGAC GTTCCTGGCA GCGCGCGCCT CGACCGGCAT TGATGACTCG
TCGTACCCCG GCATGTACGC GCTGGTTGCC GATTATTTCG GTCCCAACCT GCGCGGCAAA
GTGTATGGGC TGTTGCAGCT GGCGCAGCCA ATCGGCTACC TGATCGGCAT GGTGTTGGCG
TTGATGCTGG CGCCGCAGAT CGGATGGCGC ACCATATTTT TTTTCACTGG CGGGTTGGGA
ATTGTGGTCG CGCTCGTCAT TTTGTTGGGC GTCCGCGAAA TGCCGCGCGG CAAAGCGGAA
CCAGAGTTCG AGGGAATGAC CGAGATGGCG CGCTTCCGTT TCTCGTGGGC GGAGATGCGC
GCCGTGCTGG GGAAACGCAC GATGTGGTTC GTCTTTCTCC AGGGATTTGC CGGCGTCTTC
CCGTGGAATG TCATCACTTA CTGGTTCTTC ACCTACCTGG CGCGTGAGCG CGGCTACGAC
GAAAGCAGCA TTTTGCTGAC CGTTGCGCCC GTCATCCTGA TTCTGGCGAG CGGCAGTTTC
ATCGGTGGGG TATTGGGTGA CTGGGCATTC AAACGCACCA CGCGCGGACG GATCATCGTG
TCGAGCATTG GCGTGCTCAT GGGAGCGATT TTCCTGTATC TGGCGATGCA AACGCCGGTC
GAAGCGCGCA CGACGTTCTT CGTGCTCATG TGCCTGACGG CGCTCTTCAT GCCGCTTTCA
TCACCCAATG TCATTGCTAC GGTGTATGAT GTGACGGTGC CGGAGGTGCG CAGTACGGCT
CAGGCGGTCG AATATTTCAT CGAGAACAGC GGTGCGGCGC TGGCGCCGCT TCTGGCGGGC
ATTATTGCAG ATATGTACAA CCTGCAAACC GCCATTACGT GGATCTGCGT CACTGCCTGG
GCGCTCTGCT TTATGTTCTA TCTTGGCGCG TTGCGCTACA TTGAGCGCGA CCACCATGCT
CTGCGCGATG AGATGGGGCG TCGCGCAGCA TCCTTCCGGC AGACGATGGC GTAG
 
Protein sequence
MSEIAMMHTP RSGYRWFVIA VFFCFMLLHQ ADKLLIGPLT PAIMDEFGIT MTQMGAVTTG 
ALVVASILYP IWGYLYDRFA RARLLALASF IWGATTWLSA IARTYPTFLA ARASTGIDDS
SYPGMYALVA DYFGPNLRGK VYGLLQLAQP IGYLIGMVLA LMLAPQIGWR TIFFFTGGLG
IVVALVILLG VREMPRGKAE PEFEGMTEMA RFRFSWAEMR AVLGKRTMWF VFLQGFAGVF
PWNVITYWFF TYLARERGYD ESSILLTVAP VILILASGSF IGGVLGDWAF KRTTRGRIIV
SSIGVLMGAI FLYLAMQTPV EARTTFFVLM CLTALFMPLS SPNVIATVYD VTVPEVRSTA
QAVEYFIENS GAALAPLLAG IIADMYNLQT AITWICVTAW ALCFMFYLGA LRYIERDHHA
LRDEMGRRAA SFRQTMA