Gene Rcas_0484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0484 
Symbol 
ID5537947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp624289 
End bp626235 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content60% 
IMG OID640892646 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001430632 
Protein GI156740503 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.339951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTT CGATGCCTGC GTTTCATTTC TATGGGATAG ATGCACTGGC GCTGATTATC 
GCCGCGTTTC TCGGATATGC TTTGCGCCTC GAACGTCTCG ATCTGAGGGA ATACTGGGGG
GCGTATGCGC TGTTTACAGG CATAACCCTG ATCGTTGTCC CTTCGACATT TGCGATGGTT
GGCGTGTACG CACAGTACTG GCGCTATGCG TCGTTCTACG AGTTTAGCCT GCTGGCAGTT
GCGCTGGCGT TCGCCGGAAT CATGCTGACA GGTCTTGTTC TGCTCATGCG CGCGCTGTTC
CCGCACTTGC CGGTTGTGCC ATTGTCGGTT CCGCTGATTT TCACCATGCC CGCGATGGCG
CTGACGGCGC TGCCCCGCCT GAGCGTGCAG GCGCGCTTTC GCCCATCGTC GTTGCGTCAT
TCACAGCACA GCAGTAATCG GGTGCTGATC ATGGGCGCGG GCGAAGCGGG CGCAATGATT
GCGCACAGCC TACGCACTGC GCGACGAAAC ACCATCATCG TTGGCTTCGT CGACGACAAC
CCAGGCAAGC GCGGCGTGCG GATCAACGGC GCGTCGGTCC TGGGCAATCG GCACGATATT
CCGCGCCTGG TCGCCGATCA CCACGTCGAT GAGGTCATCA TTGCGATGCC GGGCGTTCCC
GGAAAAACGA TCCGCGACAT TGTGTCCATC TGCGAGCGGG CCGGCGTGCG CGCGAAAATC
ATCCCCGGAC TTGCCGAACT GGTTGATGGA CGCTTTAGTG TCAACCACAT CCGCGATGTG
CAGATCGAAG ACCTGCTCCG ACGCGAACCG ATCAGTACCG ATATGCAGGC GGTGAGCGCG
CTGATCCGTG GACGGCGCGT TATGGTCACC GGCGGCGGCG GTTCGATTGG TAGCGAAATC
TGCCGCCATG TGCTGCGCTA TGAACCGGCT GAACTGATCA TTCTCGGGCA CGGTGAGAAC
AGCGTCTTTG CTATTCACAA CGAGCTGCAA CAATGGCTGA ACAACGGCTG CAATGAATCC
GGCGTTCCGC GCAGCACTAC TCTTCTGCGC ACGGTCATCG CTGATATTCG CTTTACCGAA
CGGATCCATT CCGTATTCGA GCAGTACCGT CCAGAAATTG TGTTCCACGC TGCGGCGCAC
AAGCACGTGC CGCTCATGGA AGCCAACCCG GTCGAGGCGG TCACCAATAA TGTGCTCGGA
ACGCGCAACC TGCTCGATGC CTCGATTGTG ACCGGCGTCG AACGCTTCGT GATGATTTCG
ACCGATAAAG CCGTCAATCC GACCAGCATC ATGGGCAGCA GCAAGCGCGC TGCCGAGTTG
CTGGTGCATC ACGCCGCAAA ACGCAGTGGA CGGGCGTTCA TGGCGGTTCG TTTCGGCAAC
GTCCTCGGCA GTCGCGGCAG TGTGGTATGG ACGTTCAAAC AACAGATTGC AATCGGCGGA
CCGGTGACTG TCACCCATCC TGAAATGCGC CGCTATTTCA TGACCATCCC CGAAGCGGTG
CAACTGGTGT TGCAGGCGGC GGCGCTTGGT CAGGGCGGCG AGGTGTTCAC GCTCGACATG
GGCGAGCCGG TCAAAATCCT CGATCTGGCG CGCGACATGA TCGAACTCTC CGGGTTGCAG
GTCGGGCGCG ACATTGATAT TGCCTTCGTC GGGTTACGCC CAGGCGAAAA ACTCTACGAG
GAACTGTTCC TGCCCGGCGA AACGTATGAT CGCACCGGTC ACGAGAAAAT CTTCATCGCC
CGCCATGCCG GGCGACTCGT GCCGCCCGAT GTGCTAGCGC TCATCGCCGA TCTCGAAGAA
GCGGCGCTCG CGGACGACGC GCACCGCACT CAGCGCTTGC TGCGCCTTAT TGTGGAACGC
AGCCAGGCGA CATCCATCGA GATGCTGCAC ACTGATCGGG CGCAGGCGCG CCCCGATCTG
CGCGCGCTGA CGGCAGGGAG CGTATGA
 
Protein sequence
MARSMPAFHF YGIDALALII AAFLGYALRL ERLDLREYWG AYALFTGITL IVVPSTFAMV 
GVYAQYWRYA SFYEFSLLAV ALAFAGIMLT GLVLLMRALF PHLPVVPLSV PLIFTMPAMA
LTALPRLSVQ ARFRPSSLRH SQHSSNRVLI MGAGEAGAMI AHSLRTARRN TIIVGFVDDN
PGKRGVRING ASVLGNRHDI PRLVADHHVD EVIIAMPGVP GKTIRDIVSI CERAGVRAKI
IPGLAELVDG RFSVNHIRDV QIEDLLRREP ISTDMQAVSA LIRGRRVMVT GGGGSIGSEI
CRHVLRYEPA ELIILGHGEN SVFAIHNELQ QWLNNGCNES GVPRSTTLLR TVIADIRFTE
RIHSVFEQYR PEIVFHAAAH KHVPLMEANP VEAVTNNVLG TRNLLDASIV TGVERFVMIS
TDKAVNPTSI MGSSKRAAEL LVHHAAKRSG RAFMAVRFGN VLGSRGSVVW TFKQQIAIGG
PVTVTHPEMR RYFMTIPEAV QLVLQAAALG QGGEVFTLDM GEPVKILDLA RDMIELSGLQ
VGRDIDIAFV GLRPGEKLYE ELFLPGETYD RTGHEKIFIA RHAGRLVPPD VLALIADLEE
AALADDAHRT QRLLRLIVER SQATSIEMLH TDRAQARPDL RALTAGSV