Gene Rcas_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2844 
Symbol 
ID5540333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3686985 
End bp3688271 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content64% 
IMG OID640894973 
ProductCBS domain-containing protein 
Protein accessionYP_001432933 
Protein GI156742804 
COG category[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG1993] Uncharacterized conserved protein
[COG3448] CBS-domain-containing membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCTGA GTGGTCATGC ACAGCAGGTC TGGATTTTTC TGGGCGAAAG CGATCAGTGG 
CACGGACGTC CGCTCTCGTT GGCGCTCCTC GAAATGCTCA AACGCAACGG CATCGCCGGC
GGCACCGTCC TGCGCGGTAT TGCCGGGTAT GGCGCGCACA GTTTCATTCA CACAACGTCG
CTCGTCGAAC TGAGCAGCGA CCTGCCCGTT ATCGTAACGT TCGTGGATCG CCCTGATCGT
GTGGCGCGGG TCATGCCCGA AATCATGAGC ATGGTGCGCG AAGGGCTGAT CACCACGATA
CCGGTCGAAG TGCTCAAATA CACCTCGCGT GCCGTTGGTC CCTTCCCAGC GCACCTGACC
GTCGCCGACA TTATGAGCCG CCAGGTAGTC AGTGTGCGTC CCGATACGCC GATTGCCGTG
ATCGTCGAGT TGTTGATCGA CCGCGCTCTG CGATCGGCGC CGGTTGTCGA TGCAGAGAAT
CGCGTCGTAG GCATCATCAC CGACGGCGAC CTCCTCACCC GTGGCGCCAC CGAACTGCCG
CTGGCATTGC AGCGCGAACT GTCGTTGGCG GAGCGCGCCG CCGCAGTCGA AATACTGGCG
GAACGTCCAC ATACCGCCGC CGACCTGATG ACTCCTGATC CGGTGACGCT ACCGATGACA
ACGCCCCTGG CGGAAGCAGC GGCAATTATG GCGGATCGTG GCTTGAAGCG CATCCCGGTA
GTCGATGAGC AGCATCGACT GGTCGGCATG GTCAGTCGCT ATGATCTCCT GTCTACGGTC
GCTGAAGGGC TGCGCCAGCG TCCTGCAGAA CCGGTCGTGC CATCCGGCGG CGCGCCACAG
ACCGTTGGCG ACATCATGAT GACCGGCATT CCGACGGTGC GCCCGGATAC GCCGCTGGCG
GAAACCCTCG ACCACCTGCT CGAAACCGAC AAACGCCGCG TTGTCGTCGT CGATGAACAT
CACCACGTCG TCGGAATCAT CAGCGATGGC GACGTATTGC GGCGCGCGGC GAAGCGGGTG
CGTTCCGGCG CACTGCGCGC CCTGGCTGCC TGGTTTGGCG GCGGCGCCCG CCCGCCGGGT
CTCGAAGTTG CAGCCGAAGG ACGCACCGCC GCCGATGTGA TGACCAGTCC GGTGGTGACG
CTGCCAGCCG ACGCGCCAAT CACGGAAGCC GTCCGGTTGA TGATGACGCA TAAGATCAAG
CGCATCCCGG TCGTTGACGC CGACAAACGG TTCGTTGGCA TGGTCGGGCG GGCAGGGGTG
CTGGCGGCGT TGAGCCGGAG AACATGA
 
Protein sequence
MDLSGHAQQV WIFLGESDQW HGRPLSLALL EMLKRNGIAG GTVLRGIAGY GAHSFIHTTS 
LVELSSDLPV IVTFVDRPDR VARVMPEIMS MVREGLITTI PVEVLKYTSR AVGPFPAHLT
VADIMSRQVV SVRPDTPIAV IVELLIDRAL RSAPVVDAEN RVVGIITDGD LLTRGATELP
LALQRELSLA ERAAAVEILA ERPHTAADLM TPDPVTLPMT TPLAEAAAIM ADRGLKRIPV
VDEQHRLVGM VSRYDLLSTV AEGLRQRPAE PVVPSGGAPQ TVGDIMMTGI PTVRPDTPLA
ETLDHLLETD KRRVVVVDEH HHVVGIISDG DVLRRAAKRV RSGALRALAA WFGGGARPPG
LEVAAEGRTA ADVMTSPVVT LPADAPITEA VRLMMTHKIK RIPVVDADKR FVGMVGRAGV
LAALSRRT