Gene Rcas_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2154 
Symbol 
ID5539634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2765769 
End bp2767196 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content63% 
IMG OID640894287 
Producthypothetical protein 
Protein accessionYP_001432256 
Protein GI156742127 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.480414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00287082 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCA ATCTCGACAG CATCATTGCG CACTATCATA GCCTGTTCGA TGCGGCGCTG 
GCACGCGAAA CCTTCACAAT CCTCGACACG GAACAACGCG CGCGCAGCAT GCTCATCGGT
CCGCAACGCG ACCGTCTGAT CTGCCCGGTG CTGCGACCAC GGTTCATCAC CCGCGCGCAG
TACGAGACCC TGACCCGTGC TGCGTCGCTC GTCGGGCGCG CCATTCGCGC CGTGAGTGCA
GCCGCACTGA CCGACCCGAC GTTGCTGGCG CCCTACCGTC TTACTCCGGC AGAACGTGAA
TTGCTTGCCA TCGACCCCGG CTACCATGGC GCAACCGTCT TTGGTCGCCT CGACGGATTT
CTGGCGCCGG ACGGCGCATG GTGCTGGTTC ATCGAAGCGA ATGTCGAGTC ACCCGCCGGG
ATCGGGTACG ATGACGCGCT TGCCGGTATC TTCGACCAGA CCCGGATCAT GGCGGCATTC
CGCGAAACCT TCCAGGCGAC GGCGCTGCCG GTGCGCCAGG AGTTGCAACA GATGCTGCTC
GATGCCTACC GCGCCTGGGG CGGACGGGGG ACTCCGACGG TTGCCATTGT CGATTTCCCC
GGCGTGACGA CGTGGTCCGA GTTCGAGCAC CTCCAGCAGC GCTTCGAAGC GGATGGACTG
CCGACAGTTG TCTGTACGCC GGATGACCTG CAATATCAGG GGGGGCGCCT GTACGTCGAA
ACGCGCCTGG CGGATGGCGG CGCCTCCTGG CGACCGGTCG ATCTGGTCTA TCGTCGCCTG
TTACAGCACG AATTCCTCAG CATGTACGAT CTGCACCACC CGCTGATCCG CGCCTATGCC
GACCGCGTGG TGTGCGTCGT CAACCCGTTT CGCACGAAAC CGGCGCATAC CAAACTGATC
ATGTGGCTGC TCTCCGATGA CGAAGGACCG GCGAGCGGCA TCCTCGACGC CGACTCGGCG
ATGGCGGTTG CGCGTCATAT TCCCTGGACG CGGTTGGTGC AACGGGGGAC GACCCGCTAC
CGGGGAGAAC GAGTTGATCT GCTTGACTTT GCCCGCCGTC ACCGTGAGCG CCTGGCGCTC
AAACCGAACG ATGCGTATGG CGGCGAAGGG GTTATCCTGG GCTGGGAAAC TTCACCGCTG
ACGTGGGAAG GGGCGCTCGA ACGCGCGCTC GACGAGCCAT CCGTGCTTCA AGAGCGTGTA
CCGATGCCGG AAGAACCCTA TCCGATCTGG TCGGACAACG AGGGGATTGT TCTCTCCTCG
TATTATGTCG ATGCCGATCC ATGTCTCTAT GGCGACCGCG CGATGGGATG CCTGACACGT
ATCGCCACCG CTGCAAAACT GAACGTTTCC GCCGGCGGCG GTTCTGCGCC GCCGACGTTT
CTGGTCGAAC CCAAGCTGAC ACAGGAGCCT GTCAATGCCA ATCCGTGA
 
Protein sequence
MTANLDSIIA HYHSLFDAAL ARETFTILDT EQRARSMLIG PQRDRLICPV LRPRFITRAQ 
YETLTRAASL VGRAIRAVSA AALTDPTLLA PYRLTPAERE LLAIDPGYHG ATVFGRLDGF
LAPDGAWCWF IEANVESPAG IGYDDALAGI FDQTRIMAAF RETFQATALP VRQELQQMLL
DAYRAWGGRG TPTVAIVDFP GVTTWSEFEH LQQRFEADGL PTVVCTPDDL QYQGGRLYVE
TRLADGGASW RPVDLVYRRL LQHEFLSMYD LHHPLIRAYA DRVVCVVNPF RTKPAHTKLI
MWLLSDDEGP ASGILDADSA MAVARHIPWT RLVQRGTTRY RGERVDLLDF ARRHRERLAL
KPNDAYGGEG VILGWETSPL TWEGALERAL DEPSVLQERV PMPEEPYPIW SDNEGIVLSS
YYVDADPCLY GDRAMGCLTR IATAAKLNVS AGGGSAPPTF LVEPKLTQEP VNANP