Gene Rcas_3548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3548 
Symbol 
ID5541049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4627362 
End bp4628468 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content49% 
IMG OID640895667 
Producthypothetical protein 
Protein accessionYP_001433615 
Protein GI156743486 
COG category[S] Function unknown 
COG ID[COG4938] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.112036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA CAATTCATCT CCATAACTTC AAATGCTTTG GAGACCAGAG CATTCACTGC 
GCTCCTCTGA CCATTCTCAC AGGCGCTAAC GCTACTGGAA AGTCGAGCGT CATTCAGGCG
CTCCTTCTGT TACGCCAATC ACACCAACGC GATACTCTGC GTGATGGCGT CCTTCTTTTG
AATGGTTCGC TCGCCACTTT AGGGACAGTA ACCGATATTT TCTACCAAAA CGCCCAGGAT
AACGATCTCT CGATCGAAAT TGAAGCGTCT GAAGATATAA GATTCAAGTT TATGTTTGAG
CGAGGCGAGC CGACACAGCG CGTACTTAGA GGTCAATCCC TACAGCACTA CGAAGCGATT
AATCTTTTCT ATCCGCAGTT CAACTATCTC AGCGCCGAAC GTCTTGGTCC TCGAACAATC
TTTCAGATGC CAGAAGACGA GCATGAATCG CACAATGTTG GTATTCATGG CGAGTATGCC
GCATTTGTTG CCGGTCGCAA CCAGCAGGAG TTAATAGCCA ACGAAAACCT GGCATATACT
AATGAAGAGA CTGGTGAAAC ACGTCGCGAG CTCTACCAAC AAGTGCGATA CTGGATGCGG
CAAATCTTTC CTGGCTTCGA ATATGCTATC ACAGTGCTAA CGGATGCTGA TCTGGTGCAA
ACCATGTTCG GCAATTTGCC AGGTCAGAGA TTGGTGCGTC CCACAAACAT TGGGTTTGGT
TTGATCTATA CCCTGCCCGT TGTCGTCGCC GCTCTGGTAG CGCCCACAAA CTCGCTCCTG
ATTATTGAAA ATCCAGAAGC GCATCTCCAT CCTTTCAGCC AGTCGATGCT CGGTCGTTTC
CTGGCATGTA TTGCAGCAAC TGGCGTGCAG GTTATCATTG AAACTCATAG TGACCATATT
CTCAATGGAA TCAGGATTGC TGTGCGCAAA GGTGCATGGG GACGTCGAAT TGCAGCAGGG
CACATATCCA TTCAGTTTTT TATTCCTGGT GATGAGACAC GACCACATCG CGTCGACACG
CCGACCATTT ACGCAAGCGG TGGTATTATG CCATGGCCCA TCGGTTTCTT TGATCAGTTC
GACGCCGATC TCACGGAATT ACTATGA
 
Protein sequence
MIKTIHLHNF KCFGDQSIHC APLTILTGAN ATGKSSVIQA LLLLRQSHQR DTLRDGVLLL 
NGSLATLGTV TDIFYQNAQD NDLSIEIEAS EDIRFKFMFE RGEPTQRVLR GQSLQHYEAI
NLFYPQFNYL SAERLGPRTI FQMPEDEHES HNVGIHGEYA AFVAGRNQQE LIANENLAYT
NEETGETRRE LYQQVRYWMR QIFPGFEYAI TVLTDADLVQ TMFGNLPGQR LVRPTNIGFG
LIYTLPVVVA ALVAPTNSLL IIENPEAHLH PFSQSMLGRF LACIAATGVQ VIIETHSDHI
LNGIRIAVRK GAWGRRIAAG HISIQFFIPG DETRPHRVDT PTIYASGGIM PWPIGFFDQF
DADLTELL