Gene Rcas_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3894 
Symbol 
ID5541400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5098013 
End bp5099143 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content59% 
IMG OID640896005 
Productpermease 
Protein accessionYP_001433948 
Protein GI156743819 
COG category[R] General function prediction only 
COG ID[COG0701] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.932895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGCC GCACCGTTCC GCCTATGTCA CGCGCCTCGT TTGAGCGCAT TGTGCCGGTT 
GTCTCGCTGG TTGTGCTTGC CTTGATCATC GGTCAGTCGG TGGTGCAAAG CGTCAGCGCC
TGGCTTTCCG GCGCAGAAGC GAAGCATATC GTCCCGGTCT TCGAGGCGCC AATCGCCAGT
GCAGGCAAAG CATTGGGCGC CGCTCTCACC TCCTGGATGC CCGCCTGGAG CATTCCCACA
CCGTTTGGTC CACTCGATGT CAAGCATACG GCGTCGTACA CGATCTATGA ATGGTTCAAA
CTGCCAATCA TTCTCTTTCT GACGACCTAC GGCATGACAC TGTTGCGCCT GAGCATCAGC
ACCCGATGGA TTGAACGCTC CATCGGGCGG AATGATCTGC TGGGGGCTTC GGGTGGTGCG
CTGCTGGGCA TATTCACACC GGTCTGCTCC TGCACCGTCA CCAACATCTA CGCCGGCATC
GTTGCCGGCG GCGCAAGCCA GCGCGCCTCG TCGGCGTTTC TGTTTGCCAG CCCGGCGTTG
AATGAGTTCG CCATTCTTTT TATGTTTGTG ATCGTCGGAC CGTTCGGCGG GCTGGTCTAT
GTGCTGGCAG GTTTTGCCGC CGCGCTGGCG ACCGCGTATC TTGCGCCGGT TCTGGGGCTG
GATCCGGCGC GTTTTGTGCA GCAGGTCGTT TCACCGCACT TATGCGGTAC GATTGCCCGG
GAGAGCATTC TGGTTCGTGC GCACCGCGAG GCGTGGGCAT TGTTTAAGCG ATTGTTTGGC
GTTGTCCTGT TCAGTGGGTT GCTGGCAGGC ATTCTGGTCA ATTTTAACCT GACGCTGGTA
GAGAGTCTGA AACAGGCGGG CGCTGCGTGG TGGGGACCGC TCATCGCCAC CGTGCTCGGT
CTGCCGCTCG ACATTAATGC CGCTTCAACG GCGCCGATTC TGGTGGCATT GCACCAGATT
GTGCCGATTG GAACACTGGT GGCGGCAATG ATGGCAACGA CCGTCTCCTC AATCCCGGAG
TGGGCGATGC TCAATCGTCT GATTGGAAAG GCGGGAGCGA TCAAAGTCGT GCTCTGGTAT
GCAACCTATG TGGCGCTCCT GGGGTTGCTG CTCAACTGGT TGTTTGCCTG A
 
Protein sequence
MFSRTVPPMS RASFERIVPV VSLVVLALII GQSVVQSVSA WLSGAEAKHI VPVFEAPIAS 
AGKALGAALT SWMPAWSIPT PFGPLDVKHT ASYTIYEWFK LPIILFLTTY GMTLLRLSIS
TRWIERSIGR NDLLGASGGA LLGIFTPVCS CTVTNIYAGI VAGGASQRAS SAFLFASPAL
NEFAILFMFV IVGPFGGLVY VLAGFAAALA TAYLAPVLGL DPARFVQQVV SPHLCGTIAR
ESILVRAHRE AWALFKRLFG VVLFSGLLAG ILVNFNLTLV ESLKQAGAAW WGPLIATVLG
LPLDINAAST APILVALHQI VPIGTLVAAM MATTVSSIPE WAMLNRLIGK AGAIKVVLWY
ATYVALLGLL LNWLFA