Gene Rcas_3061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3061 
Symbol 
ID5540557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3963791 
End bp3964828 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content60% 
IMG OID640895180 
Producthypothetical protein 
Protein accessionYP_001433133 
Protein GI156743004 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.41699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.49585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGGT GGCATATCAG TACCAAGACA GCGCAGCTGC GACGCACTCC CTGGGGCGCC 
ATTATCGGGC TGCTTCTGCT GGCAGCTATC GTTGCGGTCG CCTATCTCAA CCGCGAGGAG
GTGATCAAAG CCTTTGTTCT GCTGCGCAAT GTCCGTCCGG GGTATCTGCT GCTCGCGTTT
GTCGGTGTCG TGATGGGCTT TGTGTGCGCC GGGCAGATCT ATGGCCGGGT GCTGGCGATT
CTGGGGCATC GCGCGCAGTT CTGGTGGTTG ACCGCCGCAG CAATGGTGAC GATTCTGATC
AACCAGGCGA TTCCTGCCGG AAGTGTCGGC GCCTATGCCT TCCTGGTCGC CAGTCTGCGG
CGACGCGGTT TCCCCGTCGG TAGCGTCGCT ATGGTCGCGG GGATGGAACT GCTCAGTTGG
AATGGCGCCG TTCTGGTGGC GTTTACGTAT GGATTGATGT ATCTGCTGGT GACGACCGGA
TTGAGCGGCG CGTCGGTCAG TTATGGCGCA CTGGCAGTTG CACTAGGGGT GATGAGTGGT
GCGATCTATG TTGCCTCACG CCCAGACACT ACGCTTCAGG AGTGGGCGCT CCGCCTGAAG
CGCCTGGTGA ATCGCCTGTT TGGACCGGTA TTGACAGCAT CGCAGGTGAC ACAGGCGGTC
GATGAAATTA TTGCCAGTCG ACGCCTGATC CTCGAACAGC CACGCCGGAT TGTGGTGCTG
GTTGGGTTGC AGTTGTTGAT CTTTTGTTGC CATAGTCTGG CATTGCTGGC AATTCTCCAT
AGCCTGGGCG CCGATCCGCC GCTTGCTGCG ATGTTCGCCG CATATGGTCT GGCGCTGATC
GTGAGCGTCT TCACTCTGCT CCCCGGCGGC GGCGGAACGG TCGAAGCTGC GCTGACCGTC
GCTCTGCACG CCCAGGGTGT GCCGCTCGAA GCGGCATTAG GGGCTGCGAT CCTGTTTCGC
CTGATCAGCT TCTGGATGAT GCTCCCCATT GGTATGCTCT GCTATCGTCT GCTTACCCGA
TCTTCCGGCA AGTCCTGA
 
Protein sequence
MRRWHISTKT AQLRRTPWGA IIGLLLLAAI VAVAYLNREE VIKAFVLLRN VRPGYLLLAF 
VGVVMGFVCA GQIYGRVLAI LGHRAQFWWL TAAAMVTILI NQAIPAGSVG AYAFLVASLR
RRGFPVGSVA MVAGMELLSW NGAVLVAFTY GLMYLLVTTG LSGASVSYGA LAVALGVMSG
AIYVASRPDT TLQEWALRLK RLVNRLFGPV LTASQVTQAV DEIIASRRLI LEQPRRIVVL
VGLQLLIFCC HSLALLAILH SLGADPPLAA MFAAYGLALI VSVFTLLPGG GGTVEAALTV
ALHAQGVPLE AALGAAILFR LISFWMMLPI GMLCYRLLTR SSGKS