Gene Rcas_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0032 
Symbol 
ID5537490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp41601 
End bp45938 
Gene Length4338 bp 
Protein Length1445 aa 
Translation table11 
GC content61% 
IMG OID640892198 
ProductTPR repeat-containing protein 
Protein accessionYP_001430189 
Protein GI156740060 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAA TGAGCCTGCA GACGGCTTTC GACCAGGTCC GGCAATGGCT CGAAACCAAT 
GACCTCGATC GCGCCATTGG CATGGCGCAC CATATCCTCG AGACCTACCC CCACTGCCTG
GAAGCGCATC AGATGCTTGG AGAAGCGCAT CTGGCCAATC GTCAGTATGA AGAAGCGTGT
ATGCATTTCG AGCACGCGCT TCAGTTCGAT CCAGAGCATA TCCCGGCGCT TGTGGGCCTC
GGGATGACCT GCGAACGACT TGGACAGCTT GAGCGCGCCA TTTCGGCCTT CGAGCGCGCC
CTTGAAATCA AGCCCGACTT GCCGGAACTG CGGAGTCAGT TGCTGCGGCT CTACACCGAT
GCCTGGGGGA GTGAATATGC GCACTTGCGT CTCAGTCGCG CCGGACTGGC GCGATTGTAC
GCAAAAGGGC ACATGCTGCC ACAGGCCATC TCTGAGTTTC GTCAGGTTGT GGCGGATCAG
CCCGACCGTC TCGATTCGCG CGTAGCGCTG GCAGAAGTCC TTTGGCGCGA TGAGCAGGAA
GAAGAAGCGG CAGATGTGTG CCGTGACATT CTTGCAAAAC ATCCCCATGT GTTGAAGGCG
AATCTGATTC TTGGGTATAT TGAATTGGCG GCTGGAAACC CGTCTGGCGA ACAGTTCTGG
AAGGCGGCTG CCGCTATCGA TCCGTATCAG GGCATGGCCC GCGTCTTGTT CGAGTCGCTC
CCGCCTGTCA CGATAGAGGA ACCGACGCTT CCAGAGTGGG ATGAGGCGGA ATGGTTCCAT
CGGCAGGTGA CGGCGCCTGC TGAGCAACTC GCCGCAACCC GTTCGATGGA GGCGACGACG
CCAACCGCCG TTGTTGCGCC ACCGTCGCCG CCCGCACCGC TACCGGCTTA TGCTGATAGC
GATGACTTTC TCGCCAGTCT GCTGGCGATC GATGCTGCGC CCCCGATCGT GTCGCATCCG
ACCGAGGAAG CCGAGGTTGG AATCAGTTCG GATACACGAC CATTTACGCT TGAGGAATTG
GGGTTGAGCG AGGCTGAACT TGCCGGTCTC GGTGCGCCGG AAGAAGCACC GTCGGAGTCC
TCGGAATCTG AGCGTTCGAT GCCGGTGAAG CCGGTTGTAT CCGAAGAACC GCCCGACCTC
GCCGCGATGC AGCCCTTCTC CCTCGACGAA TTGGGACTGT CGCCCGACGA GATTGCCGCC
CTCGACAGCG CCTCGGCGAG CGCCGCAGCC GACCAGCCGC CGCCCGACCT CGCCGCGATG
CAGCCCTTCT CCCTCGAAGA ATTGGGCCTG TCGCCCGACG AAATCGCCGC CCTCGACAGC
GCCTCGGCGA GCGCCGCAGC CGACGAGCCG CCGCCCGACC TCGCCGCGAT GCAGCCCTTC
TCCCTCGAAG AATTGGGCCT GTCGCCGGAC GAAATTGCCG CCCTCGACAG CGCATCGGCG
AGCGCCGCAA CCGACGAGCC GCCGCCCGAC CTCGCCGCAA TGCAGCCTTT CTCCCTCGAA
GAATTGGGCC TGTCGCCCGA CGAAATCGCC GCCCTCGACA GCGCCTCGGC GAGCGCCGCA
ACCGACGAGC CGCCGCCCGA CCTCGCCGCG ATGCAGCCCT TCTCCCTCGA AGAATTGGGA
CTGTCGCCCG ACGAGATTGC CAGTCTGAGC GGTGAGACAA TCATCACCAC GCCCGAACCA
TCACAACTCG ACGATTTCGA TTTTGACACC CAACCCTTCT CGCTCGACGA TATCGATTTT
GAAGGGAGTG GACGCATCCC GGCAGCGGGG CGCGATACTG GATCTGGTGC TGGCGATGTG
CCGCCCGACC TGCAACCGTT CTCGATCGAA GAACTGGAAA TCGGCAGCAT GAGCGGTCCG
GCAGACCTTG GCGAGTTGCC GCCATCACTC CAACCCTTCT CGCTCGAAGA ACCACCTGCG
CCGCAACGCC CGCGAATGGC CGGTCTTACG CCGGAAGAAG CAGCCGAAAC CCCAATCGAA
GAAGAAGATA CCTTCCTGCC GCGCGGTTTT AGCTGGCAAC AACCATCCCA GCGCACGGAG
CCATCGTTCT TCCAATCGAC ATTGGCGGCT TCGACGCCAG ACACAGGCAC TATTTTCTCG
AAACTGCAAA AGCAGCGCGA AAGTGCGCCG CTGCCGCGCG CGGAGGAGCC GCCTGCGCCG
CCCATCGCCC CGGATGAACA CCTGGGATTG TTCTCATTGG ACGATGTACC CCTGCGCGAC
GATGAGTCGC TTGAATCTGG CGCCCTCTCC TCCACTGTGG CGCCGCACAT CGAAGCGCAA
TGGCAATCTT TGGTCCCTCC GGCTCATGCG GAACGACCGA CTGATACGGC GCACATCTCG
ACACCGCCCG GCGATAGCGG ACAGACACAG GCGATTTCAC TCGAAGAAAC AGAAAGCATC
GAGTCCGCCA TTGCCAGCGG GGTCATTCAG CCATTCTCAT TCGTCGATCT TGGTCTGACC
GAAGAAGAGA TTGCTGCGCT TGGCTTGAGC ACCACGCCTG ATATGCCGGG AGCGCAGGAG
GAAATACCAG AGACCCCGGC GGAAATCGGC GCAGCGGCGC AGGAAGAACC TCCTGCCGTA
GCAGCGTCGC CGGTCGAGCC GCCCGTCACC CCTCCCGATC GGGAACAGGC GCCGCCACCA
CTCGAGGAAG TGGAAAGCAT CGAAGCCGCG TTGTCGAGTG GTCAGATTCA ACCATTTTCA
TTCACCGACC TGGGTTTGAG CGAAGAAGAA ATCGCCGCCC TGGGTCTTGG CGATCTGGAA
GGATTTGCCC AACCGTCGGT TTCTTTCGAA CCAGAGACTC TCGAGGAGCC GTCGGACATC
GAGATGTTTG AGGCGCCGCC CAAACCGGAA CCATCTGCCG AACTCTCGAC AGGCGCCGGC
GTTTCTCCGG CGCCTGAGAC AACGGCAGAG CCTGTGGTCG AAGAAGCCCT CGGCATCGAA
GAGTTGCAGC CGTTCTCGCT TGACGATCTG GGTCTTTCGG AAGAGGAACT GGCAGAATTC
GATCTGTCGA GCCTCGAGGA TGAGGAAGGA GCGCACGACG AAGGACGCCT GGGCATTACC
GAGGAAGAAC TGGCAGCCCT GGGAACAGGC GGCGATCTTG CGTGGGCTCC GGAGCCTGCA
CCTGTGAGTG AGCCGTCTGC CACTGCATTC GCGCCAATTG AATCTGCGCC TGAAGTCAGC
AGCGGGGATG AGGTTGTCGA TCGGTTGATT CAGATCGGTC ATGAGCGCGG TTATGTCGAT
ATCGCCGACA TCATTGCTGC GGTGAAGGAC CCCGAAGCGG AAGCGGCGCG GATCGACGAG
ATCGGGCGAA TGCTCCACGC CGCGCGGATC GAGATACGCG ATGGCGACGA GGTGATCGAC
CTCGATGCAG AGTATGCGGA TGAGGAAGCG CCATTGATGC CAGAAGGCGC CACGCCTGCC
GCGAATGCGG CAGAGGAAGA AGATCTGATG CGCCCCTTCT CACTCGAGGA ACTCGGGTTG
TCGGATGACG AAATCGCCAT GCTTGCCGCC GCTGCCGCCA GCCGTGGGGA GGAAACGCCG
TCAACGCCAG CGGAAGAAAC GCCACTCCCC CCCTTCTCAT TGGAAGAAGC CGGTGCAGCA
GACGATGAAA TTGCCATAAT CGCTGCTGCC AGTAGCGGAG AAGAGGCGCC ACCCGCTGCT
GCCGAGGAGC CGTCGCTCAC CCCCTTCTCG CTGGAAGAAC TGGGATTGTC GGAAGACGAG
ATCGCTCTGT TAAACGAGAC GGCAGCATCT CTCGAAGCGC CACCTCCACC CGCCGCTTCA
ATCGAAGCGG AAACCACTGC ATGGTTCGAT CTCGAACCGG TCGTTGCCGA TGCAGAAGCG
GCAGCGTCAC CCGAATCGTC GGCGCCGGAT ATGGAAGAAG CGCAACCGCC AGCGCCACCC
AAACGTATCG AGCCGGCGCC AGCACCGGCG CCGCCCAAAC ATATTGAGCA ACCGCCGGCG
ATTGCTGCGG CAAAACCGCC TGCTCCGCCG CCAGCGCCGT CGTCTGCGGA AATCTCCAGT
CTGAGCGATT ATCCAGAACT GCAAGAGTAC CTGAACATGC TCGAAACCAA CCCGGACAAT
CACCTCTTGC GGCTCTCGAT TGCGCGCTTT GGCGGTCAGG CGGGAATGTC GGAAATCGCT
ATGCAGCACT ATCGCCGCCT CATCAAGCAG AATGTGCTCC TCGATGAAAT CGTCGACGAT
CTCACCGATA TGATTGCAGA GACCAGCGAT GCGAATCTGC TGCGTAAACT GCATCGCACC
CTTGGCGACG CTTACTCGCG TCAGGGGCGC TTCCGCGACG CGATGCGCGA GTACAGCTGG
ATACCTGGAC AGGCATAA
 
Protein sequence
MARMSLQTAF DQVRQWLETN DLDRAIGMAH HILETYPHCL EAHQMLGEAH LANRQYEEAC 
MHFEHALQFD PEHIPALVGL GMTCERLGQL ERAISAFERA LEIKPDLPEL RSQLLRLYTD
AWGSEYAHLR LSRAGLARLY AKGHMLPQAI SEFRQVVADQ PDRLDSRVAL AEVLWRDEQE
EEAADVCRDI LAKHPHVLKA NLILGYIELA AGNPSGEQFW KAAAAIDPYQ GMARVLFESL
PPVTIEEPTL PEWDEAEWFH RQVTAPAEQL AATRSMEATT PTAVVAPPSP PAPLPAYADS
DDFLASLLAI DAAPPIVSHP TEEAEVGISS DTRPFTLEEL GLSEAELAGL GAPEEAPSES
SESERSMPVK PVVSEEPPDL AAMQPFSLDE LGLSPDEIAA LDSASASAAA DQPPPDLAAM
QPFSLEELGL SPDEIAALDS ASASAAADEP PPDLAAMQPF SLEELGLSPD EIAALDSASA
SAATDEPPPD LAAMQPFSLE ELGLSPDEIA ALDSASASAA TDEPPPDLAA MQPFSLEELG
LSPDEIASLS GETIITTPEP SQLDDFDFDT QPFSLDDIDF EGSGRIPAAG RDTGSGAGDV
PPDLQPFSIE ELEIGSMSGP ADLGELPPSL QPFSLEEPPA PQRPRMAGLT PEEAAETPIE
EEDTFLPRGF SWQQPSQRTE PSFFQSTLAA STPDTGTIFS KLQKQRESAP LPRAEEPPAP
PIAPDEHLGL FSLDDVPLRD DESLESGALS STVAPHIEAQ WQSLVPPAHA ERPTDTAHIS
TPPGDSGQTQ AISLEETESI ESAIASGVIQ PFSFVDLGLT EEEIAALGLS TTPDMPGAQE
EIPETPAEIG AAAQEEPPAV AASPVEPPVT PPDREQAPPP LEEVESIEAA LSSGQIQPFS
FTDLGLSEEE IAALGLGDLE GFAQPSVSFE PETLEEPSDI EMFEAPPKPE PSAELSTGAG
VSPAPETTAE PVVEEALGIE ELQPFSLDDL GLSEEELAEF DLSSLEDEEG AHDEGRLGIT
EEELAALGTG GDLAWAPEPA PVSEPSATAF APIESAPEVS SGDEVVDRLI QIGHERGYVD
IADIIAAVKD PEAEAARIDE IGRMLHAARI EIRDGDEVID LDAEYADEEA PLMPEGATPA
ANAAEEEDLM RPFSLEELGL SDDEIAMLAA AAASRGEETP STPAEETPLP PFSLEEAGAA
DDEIAIIAAA SSGEEAPPAA AEEPSLTPFS LEELGLSEDE IALLNETAAS LEAPPPPAAS
IEAETTAWFD LEPVVADAEA AASPESSAPD MEEAQPPAPP KRIEPAPAPA PPKHIEQPPA
IAAAKPPAPP PAPSSAEISS LSDYPELQEY LNMLETNPDN HLLRLSIARF GGQAGMSEIA
MQHYRRLIKQ NVLLDEIVDD LTDMIAETSD ANLLRKLHRT LGDAYSRQGR FRDAMREYSW
IPGQA