Gene Rcas_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0050 
Symbol 
ID5537508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp64741 
End bp66081 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content62% 
IMG OID640892215 
Producthypothetical protein 
Protein accessionYP_001430206 
Protein GI156740077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0359798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATCAA CTCAACGCTT GTTGATGGTT GCTGCGCTTG TCGCTATCTT CGCATCGTTT 
CCGCTTGTCG ATCAAGCCCA CGCTGAAGAG ACGTCTCCGT CTCCTTCTTC AGACTCTCCC
TTTGCAACCT CACCGATCGC TCCGACGTAT CGGGTGTTCG CCACGCGCCA GGGGCGCGTC
GGTCGCCGCA CTGCCAATGG GCACATCATT CAACCACGCG ACCGGTTCGT GGCGCTTCCG
TCGTGGAGCG CGCTTTCGAG CCGTGGTGGA TCGGAGTATC AGGTGCGCGT CACCTATCGC
AACCGCAGCG TCGTGTTGCC GGTGTGGGAC GTCGGTCCGT GGAACACGCG CGATGATTAC
TGGTCACCGA ATCGACAGTA TGGCGACCTT CCTGTCGGAC TGCCAATGGC GCAAGCCGCG
CGCCAGCAGG GGTACAACAA CGGACGCGAT GAGTTTGGGC GCCGTATCCG TCAACCGAAC
GGCATCGATA TTGCCGATGG CGCGTTCTGG GATGATCTTG GTATGGTCGA TAGCGATTGG
GTCGAGGTGA CGTTTTTGTG GCTCGGCGCC GACCCGTTCG TTGCATCAGA TGATGCCTCC
TCAACGACTG ATCGCGCGGC GGTCGAACCG GAGGCGATTG TCGTGGATGA TGGCACTTCA
GAATACGCTG CAACCCATGG AAGAAACTGG CAACACGCCG ACTGCGGATT CGGCGGCGGA
CACGCCTGGA GTTACGACAC GCCGCAGGCA ACGGTGCGCT CTCAGCACCG CGCCGTCTGG
TCGCCCGATC TCCCAGGAGA AGGCTTCTAC GAGGTCATGG CATTCATTCC AACATGCGGT
CCAACACCGA CGAGCCGGGC GCAGTATTCG GTGGTGCATA GCGGCGCGGT GTCTGATGTG
GTTATTGATC AGGGCGCAGC ATCCGGTGGA TGGGTCTCGC TCGGGGTCTT CCACATGGGA
CCCGGTAGTT CGGTGACGCT GACCAATCAG ACCGGCGCCG ATGGTCGCGC GGTGCATTTT
GATGCGCTTA AGTGGGTTCC GCGCAACGAC CAGGCGCCAC CGGACGCTTC GGTGATTGAG
GCGACGCTCC TGCCCGAAGG CGGCATTCTG GTGCGCTGGG ATGGACAGGA CGACGTCAGC
GGCATTGCGT CGTTCGATGT GCAGGTGCGT CGCGCCCCCG ACGGCGAATG GATCGATTGG
CGTAGTCGGG CGACCGATCG GGAAGCGCTC TTCGTTCCCT CTGAGCCGGG CGCCTATGCC
TTCCGCGCTC GCGCCCGCGA TTGGACCGGC AAAGAACAGC CCTGGCCCGA TCTGGATGAT
GTTCAGATCG TTGTGCCATA G
 
Protein sequence
MLSTQRLLMV AALVAIFASF PLVDQAHAEE TSPSPSSDSP FATSPIAPTY RVFATRQGRV 
GRRTANGHII QPRDRFVALP SWSALSSRGG SEYQVRVTYR NRSVVLPVWD VGPWNTRDDY
WSPNRQYGDL PVGLPMAQAA RQQGYNNGRD EFGRRIRQPN GIDIADGAFW DDLGMVDSDW
VEVTFLWLGA DPFVASDDAS STTDRAAVEP EAIVVDDGTS EYAATHGRNW QHADCGFGGG
HAWSYDTPQA TVRSQHRAVW SPDLPGEGFY EVMAFIPTCG PTPTSRAQYS VVHSGAVSDV
VIDQGAASGG WVSLGVFHMG PGSSVTLTNQ TGADGRAVHF DALKWVPRND QAPPDASVIE
ATLLPEGGIL VRWDGQDDVS GIASFDVQVR RAPDGEWIDW RSRATDREAL FVPSEPGAYA
FRARARDWTG KEQPWPDLDD VQIVVP