Gene Rcas_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3209 
Symbol 
ID5540707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4172585 
End bp4176016 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content67% 
IMG OID640895330 
Producthypothetical protein 
Protein accessionYP_001433281 
Protein GI156743152 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0782308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.555781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGGT ATCTGGAAGC GATAGTGAGC AGTGAAGCGG GCGCGCGAAT GTCACCCGCG 
CTGCGGCTGG CGATCCACGC GATTGCGGCA GCCCTGGCGG GCATGTTGCT GGTAACCCTG
CTGTGCCAGA TACCGGTTGC GCACCGTGTG GACGTGGGAC GGTTCGACGC CGGGTATGTG
CGCGGTTTCT ACGACCCGGA GCGCCTCGAT TTGCCGCAGG CGCGCGCCTA TCTGAACGAG
TCGGATGGAA GTGCGCGCTG GACGCGCGCC GAGTCGTTTC TGCTGTTTCC GCAGGCTGGT
CTTCCCGCTG AGGTGACGCT GCGGCTGCGT GGTTGGCGTG CTGATGGTTC GCCGCCGAAT
GTTGCGATCC TGATCGATGG GCGCGAGGCG TATACCGGCG TCACCACCGG CGAGTGGCAG
GAGATCCGGC TGGCGGTGCA GCAATCCTCG TTGAAACCAG AAGATATGCT GATCACGCTG
CGCGTGGACA CGGCGCCGAT CAGCGCCGGC GACCCGCGAC CGGCAGGGGT GCTCGTCGAT
GCAGCGGAGT ATCGCACTGC GCGACCGCCG TTGCAACCGT ATCCGGCACA ACTGGCATGG
GGGGCGCTTG CGGGGGCGCT CCTGGCGCTG GCGCTGGCAG ATACGCAGTG GGGCAGACGC
GCGCCCTGGC TACGCTGCTT GACGCCGCTG CGCGCGTGGG TTGCCGGAAT GCTGTTGATC
GGTCTTGCGT ACCTGCTGTT GTACCGGCTT CAGCCCGCGT ACCCCTACCC CTTGCGCGGC
TTGCTTCCCG GTGTATGTGC TCTGCTTGGC GCAACGATGG CGGTGCGGTA CGGTCCGGCG
CTTGCGGCAC GCCGGCCCAC ACTGCCGGAT GTTCTGGCAG CAGGCGGCAT CGTGGTATGG
ACGACGGCGA TTCTGCTGGC GGCGCAGGAT CACGTGACGT TGTCGGTTCC AGGAGTCGAA
AAAGATTTCC GTGTCTTCGC TACGCGCGCC ATCGATCTGG CGCTCATCTT CCGCGCCGAT
GGGTTCTACC ATCTGGGATA TCCGCTCATG CTCTGGTGCG TTGCGCCGTT GACCGAGGGG
AATGTCTTTC TGGCGGCGCG CCTGATTGCC GCCTGTGCCG GCGCTGTCTT CCTTGGCGCT
TCCTGGGTCC TGGCGCGCTG CACGCTGGGG CGCGGACCAG CGCTGGCGGT GCTGACGATG
CTGGCACTGA ACCCATTTGT GGTGCAGTAC GCGCTGTACA TCGGCAGCGA TATGCCGTTC
GCGGCGTTCT GCGCGCTGAC CCTGGCGCTG CTGGCACGGT GGACGGAGCG GAAAACCGCC
TGGCTGCTCA TCCTGGCTGG CGTCGCGGCG GGCTGCGCGT TCCTGGTGCG GCACCCGGGA
ATCCTGTTGT TCCCATTCGG AGTGCTGGTT GTGTGGATGA GGCGCGAGGC GCGGGGCGCG
AGGCGCGAGG GGCGTGAGGC GCGCGAGGCG CGAGGCACCC GACAATGGCG TAGCGCGAGA
TTTATCTCGC ATTTGTGGCG AGGCGCGAGG CGCGAGGCGC GCGAGGCGCA AGGCGCCCGA
CAATGGCGTA GCGCGAGATT TATCTCGCAT TTGTGGCGAG GCGCGGGGTG TGAGGCGCGC
GAGGCGCGAG GCGCGGGGCA AGAGGCGCGC GAGGTGCGAG GCGCCCGACA ATGGCGTAGC
GCGAGATTTA TCTCGCATTT GTGGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC GCGGGGCAAG AGGCGCGCGA GGCGCGAGGC
GCCCGACAAT GGCGTAGCGC GAGATTTATC TCGCATTTGT GGCGAGGCGG GGGGCCTGAG
GGGCGTGAGG CAAGGGAGAC GCAGCAAGAA GCAAACGTTC AACGTTCAAC GTTCAACGCT
CAACGTTCAA CATTCAACGT TTCCGCCTTC GCTCTTGCCT TTTGTGTGGC AATTGCGCCA
CAGGTGATCG TCAATGTGCG CGATACGGGC AATCCATTCT ACAATCAGCA GGCAAAGAAC
ATCTGGCTTG CCGTGTACGG CGATAGCGAT TGGGGACGCT GGAACGAGGT CAGCAACGAC
GTGTCGCTCG CCGATGTCAT ACTTGAAGAC CCGGCGCGTG TTGCGAGTGC GTGGTGGTCG
AACCTGCGCG CATTCATCGG CGCCGGCGCG GAGTCGGCCG GCGATGCAGG GCAGGCGTCG
CAACTGCGGT TACTGGCATT CCCGGCGAAC TGGCTGGCAG TGGTCGGTCT CGTGGGTTGG
CTGGCGCTGA TCGGGCTGCG GCGCAGGAGT CCAGGCTTTC GAGCCGACGG CGGAAGCGCA
CAGGAAGTGC CAGACGATCA CATTCGTGCA GCAACGCACC CGGACGGACG CCTCATCTCG
CGCCGGAAAG GTGTCCGCCC AGTCAGACGC CCCAGCCTGC GCAGATGGAC GTCCGTCCGC
AGGTTCACGA AGGGCGATGA CAATCGTGGG CAACGACGGA GTTGCCGCCG CAGGCGACGG
TTGCATGGAG CGCATTTGCT GTCTACTCAC AATCACACTC CTGCCACCCC GCCATTTGCG
GCGCTGTTGC TCGTGTGGGT TGCGCTATAC ACTGCCGTGC TCGCCGTCGG CCTGCCATTA
CAACGCTTCT ATCTGCCGCT GGCGCCGATC TATGCTCTGG CAGCAGCATG GACGCTGGCG
CTGGCGATTG GCGCACTGGC GACGCGATGG GCATGGCATC CGGCGCGGCT CTGGATCATT
GGCGTTCTGG TGTTCCTTCT GCTCCTCTGG CAGGGATTCG CGCTCGGCGC GCGCGAAGTG
CTCGACCGCC AGCCCGCCGA CGAGGTTGTG GCGATACGTC TCGTACAGCA GACCGTGCCT
TCTGGCGAGC CGCTGCGCGC CATACTTGCG CCTGGCGATC CGGTCGGAAA GTACTCCGCG
ATTGCGCATC GCATTGTGCC GCCGGATCAG GAGACGCGCT ACCTGTTGCA CAGTAGCGAG
TCGGGACCGC GTCCTGATGG CACGCTGATT GCAGTATTCG GAAGATACGC ACTGGTGCAG
GTTCAGCAGT GA
 
Protein sequence
MHRYLEAIVS SEAGARMSPA LRLAIHAIAA ALAGMLLVTL LCQIPVAHRV DVGRFDAGYV 
RGFYDPERLD LPQARAYLNE SDGSARWTRA ESFLLFPQAG LPAEVTLRLR GWRADGSPPN
VAILIDGREA YTGVTTGEWQ EIRLAVQQSS LKPEDMLITL RVDTAPISAG DPRPAGVLVD
AAEYRTARPP LQPYPAQLAW GALAGALLAL ALADTQWGRR APWLRCLTPL RAWVAGMLLI
GLAYLLLYRL QPAYPYPLRG LLPGVCALLG ATMAVRYGPA LAARRPTLPD VLAAGGIVVW
TTAILLAAQD HVTLSVPGVE KDFRVFATRA IDLALIFRAD GFYHLGYPLM LWCVAPLTEG
NVFLAARLIA ACAGAVFLGA SWVLARCTLG RGPALAVLTM LALNPFVVQY ALYIGSDMPF
AAFCALTLAL LARWTERKTA WLLILAGVAA GCAFLVRHPG ILLFPFGVLV VWMRREARGA
RREGREAREA RGTRQWRSAR FISHLWRGAR REAREAQGAR QWRSARFISH LWRGAGCEAR
EARGAGQEAR EVRGARQWRS ARFISHLWRG AGQEAREARG AGQEAREARG AGQEAREARG
AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG
AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG AGQEAREARG
ARQWRSARFI SHLWRGGGPE GREARETQQE ANVQRSTFNA QRSTFNVSAF ALAFCVAIAP
QVIVNVRDTG NPFYNQQAKN IWLAVYGDSD WGRWNEVSND VSLADVILED PARVASAWWS
NLRAFIGAGA ESAGDAGQAS QLRLLAFPAN WLAVVGLVGW LALIGLRRRS PGFRADGGSA
QEVPDDHIRA ATHPDGRLIS RRKGVRPVRR PSLRRWTSVR RFTKGDDNRG QRRSCRRRRR
LHGAHLLSTH NHTPATPPFA ALLLVWVALY TAVLAVGLPL QRFYLPLAPI YALAAAWTLA
LAIGALATRW AWHPARLWII GVLVFLLLLW QGFALGAREV LDRQPADEVV AIRLVQQTVP
SGEPLRAILA PGDPVGKYSA IAHRIVPPDQ ETRYLLHSSE SGPRPDGTLI AVFGRYALVQ
VQQ