Gene Rcas_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1668 
Symbol 
ID5539144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2147527 
End bp2148978 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content62% 
IMG OID640893805 
Producthypothetical protein 
Protein accessionYP_001431778 
Protein GI156741649 
COG category[S] Function unknown 
COG ID[COG1300] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00126886 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0026953 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGCTG TAAGCGCGAT ACGCATCATT GTTCGCCGTG AAGTCAACGA CACACTGACC 
GATTGGCGTA TTCTTGTGCC AATCTTTATT CTCACATTTG TGCTGCCACA ACTGTTGATC
GCTGCATCGA GCGTGGCAAT CGACTTTGTC GGCGATCGCG GGTCGGTCGT TCGTCTCATT
CCTTTCGCCA TGTTGCTGGT CGGCTTCATT CCCGCATCAT TTTCGCTGAT CACCGCGCTC
GAGTCGTTCG TCGGCGAACG GGAGCGCAAC AGCCTGGAAG CGCTGCTGTC GATGCCCCTC
TCTGACCATG CGCTCTACCT GGGGAAACTC CTTTCTGCGC TCATTCCGCC GCTCATGTCG
TCGCTCCTGG CGATGACCAT CTTTGGCATC TCGCTGCGCG TGCGTGAACC CGACCTCTTC
TTCGATGGAC TGACGTTCGA GTATCTGGTG GTGGTGCTGC TCCTGATCCT GGTGAAGGCG
GTGGTCATGG TCGCCGGAGC CGTGATCATT TCGAGCCATA CGACGAGTAT TCGCGCCGCC
AATCTGCTGG CAAGCTTCGT GTTGTTGCCG ACGGCGGCAA CCATTCAGCT CGAAGCGCTC
CTGATCATTG CGCGCCGCTG GGACGTGCTC TGGCTGGCGG TCGCGCTCCT GCTCGTGATC
GCCGCAGTGC TGACACGCAC CGGTATGGGG GCGTTCAACC GTGAAGAGAT TCTATCGCGC
GAGCACGAGC AGCTGAACCT GCGCCATATC GCGCAAACAT TCCTGACGTT TGCGCGTGAG
TATCAGCCGG CAGGCACGCC GCCGGAGGCC TACACCGGCG CGCCGTTCTC GCTGCGTCGG
TTCTACCGGC ACGACCTGCC TGCGCTCCTG CGCGACTATC GCACGCCATT GCTCGTGGCG
CTGCTGGCAG CAATCGCTGG CGCGCTGTGC GGTCCGCTCC TCGGAGGCTT CTTCGACCGG
ATCGGGCAGT CGCCGGGGCG CGTCGGCATC ACGCCGGAGC CGAGCCTGGC GCTTGGCATC
TTCACCGGCA TCGCCAACAG CGCGCGCCTG CTGATCACGG CGCTGCTGGC AACCTTCACG
TTCGGCATCT TTTCGCTCAT GGTGCCGTTC TTCGCCTTCG GCGGCATCGG GTACATCGCC
GGGGCGCTGA TGGCAGGCGG CGGCGACTGG CTGACGCTGG GACCCGATAG CCCGCTTCAG
TTTGTGATTG GCTATGTGCT GCCGCATGGC ATCATCGAAC TGCCCGCTGC CCTGCTCGGC
GCGGCGCTCG GCATCCGCAT CGGCGCCGCC GTGATGGCTC CGCCAAAAGG GTTCACCGTC
GGGCAGAATA TCCTCTGGTC GCTAGCGCAG TTCGGTAAAG TGTGGCTCTT CGTCATCCTG
CCGATGTTCC TGCTGGCAGG GATTGTCCAG CAACTGATCA CGACGCGCAT TCTGGCGGCG
CTGTACGGAT AA
 
Protein sequence
MSAVSAIRII VRREVNDTLT DWRILVPIFI LTFVLPQLLI AASSVAIDFV GDRGSVVRLI 
PFAMLLVGFI PASFSLITAL ESFVGERERN SLEALLSMPL SDHALYLGKL LSALIPPLMS
SLLAMTIFGI SLRVREPDLF FDGLTFEYLV VVLLLILVKA VVMVAGAVII SSHTTSIRAA
NLLASFVLLP TAATIQLEAL LIIARRWDVL WLAVALLLVI AAVLTRTGMG AFNREEILSR
EHEQLNLRHI AQTFLTFARE YQPAGTPPEA YTGAPFSLRR FYRHDLPALL RDYRTPLLVA
LLAAIAGALC GPLLGGFFDR IGQSPGRVGI TPEPSLALGI FTGIANSARL LITALLATFT
FGIFSLMVPF FAFGGIGYIA GALMAGGGDW LTLGPDSPLQ FVIGYVLPHG IIELPAALLG
AALGIRIGAA VMAPPKGFTV GQNILWSLAQ FGKVWLFVIL PMFLLAGIVQ QLITTRILAA
LYG