Gene Rcas_4412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4412 
Symbol 
ID5541925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5665141 
End bp5667504 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content61% 
IMG OID640896510 
Producthypothetical protein 
Protein accessionYP_001434446 
Protein GI156744317 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00112817 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0579767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCG ACAATCCGCG TCTCGACGTC GCTGTGTTCA TCGACTTCGA GAACGTCTAC 
GTCAGTGTCC GCGACAAACT CGACGTCAAT CCAAACTTTG AAATCATCAT GGATCGCGTT
GCGGACCTGG GACGGGTGGT GATCGCCCGC GCATACGCCG ACTGGTATCG CTATCCGAGA
GTAACGAGCG CCCTGTACGC CAATGGCATC GAGCCGATGT ATGTCCCGAC CTATTACTAT
GATCGCGATC TGGGTCGGAC GGGGCGCGCC ATCAAGAACA GCGTCGATAT GAATCTCTGT
ATCGACGCGA TGAAAACACT GTACACCAAC CCGAACATCG GCAAATTCGT GCTCGCCACC
GGCGACCGCG ATTTTATTCC GCTGGTGAAT GCGATCCGCC AGCATGGCAA GGAAGTCATT
ATTATCGGAG TGGGGGGCGC CGCTTCGGGG CACCTTGCCC AAAGCGCCGA TGAGTTCATC
TTCTACGAGC AATTGCTGGG CAAGAAGCCG CAGCCGCTGC AAGCCGATGC GCCGCGTATC
CGCATGCCAG AGCGCGGGCG CGATGTTGAG GAGGTAGTGG AACCTGCGCC ATCCGTGCGG
ATGCGCCAGG AAGCGCAATC GGCGGAGCAC ACGCCGCCGC CTGAACCACC CGCTCCCGAA
ACCGCGCGTG AAGAAACCGA CATTTACGAC ATGCTCGTGC GGGCAGTGCA TCTGGCGCGC
GAACGAGGGT ATGTCTGCTC TTTCGGCTCG CTCAAACTCG TGATGAAAGA GTTGATGGGT
GGGGAGTTCA AGGAGAGTAA GTATCGCGAC TCGACCGGCA AGCCGTTCGC AAAGTTCAAA
GATTTTGCCC TCGAAGCCGA GCGCCGCGGC AAGGTGCAGG TTTTTACGAG CGGCGCCGTC
GTCGAAGTCT TTCTGCCCGG CGAGGACCCG TACAAACTTT CGCAGTTCGC CCAGGACCTC
AAAGAAGAAC CGCCGGTCAG CACGTCCACC ACGCCGGTGC ACGTCGATGC CCATATCGAT
GGTCGCCCGG TCTCGAGCAG CCGCCGCCGT CGCCGCCGTC GCAGCAGCAC AACTCGCACA
ACAATCTCGC CCGATACGAT CACTGCCCCA TCGCAGGATG AGTTCGTCGT GCATGAGGAC
GTGACCGACG AATTGATGAC GGAGGCGCTG CACGACGTGA TGAACGGCGA ATCCGCCGCG
ACGGACGTGG TCTTCGAGGA ATTGCTCGAC CGCCTGGAAG CCGAGCGTGA GCGACAGGAA
GCGCTTGCAT CGTTCGATGT GCCGATTGAC GAGCCAATGC TTGATGAGTT GCCCGATCTG
GAGGAGGAGC CGGATGTGAC GATATCGGGC GACAGTCTGG TGGAAGAACC GATTCTGGCA
GCGCCTGAAC CGGACGATCA GATGATGATT ATGGATAGGC CGGTTGAGCT GTCGGAGCCA
TTTGCGACGC CTGAAACACC GGTTGGTGAA CCAACAGCGC ACCTGCCGGA GACGCCTCCC
TTCACCGACG CCGAGTGGCA GATGTTGCGC GAGGTGGTCG TTTCGGCGGG ACGTCCTTTG
TCGTTCGCAC AGATCCACGA TCTGATGCGG GGCGCCCGCA ACAACGCCGG CATCGTTCGC
ACGAACGAAG AACTGCGGTC GCTGATCAAG CAGGCGATTA ACACCGGTGT CCTTCGCCGC
AGCGGCAAAG GCGCCCGCGT CGTGTATCAC CTGGCGCCCT ACGAAACATC CGCAGCGACA
GAAGGCGTCG ATGCCTCTGC GGTCAAGGCA TCGGAAGAGG CGGCGTCTGA GCCGTCGATT
ATCAGGGACG CTGTCGAGAC GATGGTTGTG TCCGAGGCAT CTGCGCCTGC GCCTGCATTG
GAGTCGCCTG AAATCGTTGC TGCCGAACCA CTCGCTGCCG CTGACGTATT ACCAGCGGAT
GTGACCGACG AGATGCCGCA GGGAAAGGTT CCTGAAGAAA CAGAGACCGC TGCCGAACCA
GAAGTGTTGG CGCCTGCGCC GGACGCAGTG GAAGAAGGCG TCGTCGCTCA TGGTGTGGAG
TCGCCGCCGG TCGTAGCGGA AGGATCGGTG ATGGCGGAGG AGACGCCACG TGAGACGTCG
GTCACCGTCG GCGTCCCGGC AGAGGCGCCG GTGACAGCAG TTGACGATCA CGCAACCGCA
GAGTCAAAGG TTGTCGACCA ACCGACGCCA CGCCGTCGTC GCACTACTGT GCGCACGAAC
GCAGAAGCCG ATCAAACGAC GGCGGAAGCG CGTCCAACAC GTTCACGCAG CAAAGCTGCC
GTTACCGCGC CGACCGAAGA AGCGCCGCGT CCGACACGGC GGCGCAAGAA AGTCGAGACG
ACTGAAGAAC CGACTGAGGT GTGA
 
Protein sequence
MATDNPRLDV AVFIDFENVY VSVRDKLDVN PNFEIIMDRV ADLGRVVIAR AYADWYRYPR 
VTSALYANGI EPMYVPTYYY DRDLGRTGRA IKNSVDMNLC IDAMKTLYTN PNIGKFVLAT
GDRDFIPLVN AIRQHGKEVI IIGVGGAASG HLAQSADEFI FYEQLLGKKP QPLQADAPRI
RMPERGRDVE EVVEPAPSVR MRQEAQSAEH TPPPEPPAPE TAREETDIYD MLVRAVHLAR
ERGYVCSFGS LKLVMKELMG GEFKESKYRD STGKPFAKFK DFALEAERRG KVQVFTSGAV
VEVFLPGEDP YKLSQFAQDL KEEPPVSTST TPVHVDAHID GRPVSSSRRR RRRRSSTTRT
TISPDTITAP SQDEFVVHED VTDELMTEAL HDVMNGESAA TDVVFEELLD RLEAERERQE
ALASFDVPID EPMLDELPDL EEEPDVTISG DSLVEEPILA APEPDDQMMI MDRPVELSEP
FATPETPVGE PTAHLPETPP FTDAEWQMLR EVVVSAGRPL SFAQIHDLMR GARNNAGIVR
TNEELRSLIK QAINTGVLRR SGKGARVVYH LAPYETSAAT EGVDASAVKA SEEAASEPSI
IRDAVETMVV SEASAPAPAL ESPEIVAAEP LAAADVLPAD VTDEMPQGKV PEETETAAEP
EVLAPAPDAV EEGVVAHGVE SPPVVAEGSV MAEETPRETS VTVGVPAEAP VTAVDDHATA
ESKVVDQPTP RRRRTTVRTN AEADQTTAEA RPTRSRSKAA VTAPTEEAPR PTRRRKKVET
TEEPTEV