Gene Rcas_1360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1360 
Symbol 
ID5538832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1738511 
End bp1740442 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content60% 
IMG OID640893497 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001431474 
Protein GI156741345 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAC GACTCAAGAA TCGGCACTTT TTGATCTTCG ATGTGCTGCT CGTGCCGCTG 
GCAATCTACG TCAGCTTCGT GCTGCGCCTC GAAACGTTCG ATCTCAAGAC GTACTGGGTG
GCATGTGCGC ACTTCTGCCT GATGGCAGTC ATCGTCACTC CACTGATATT TCGCGCATTC
GGCGTCTATC GCCGCTACTG GCGCTACGCT TCGTTCGAGG AAGTGCTGCT GCTCTGCAGT
GCAACCTCGC TGGCAATGGG AGCCACTGCG ATCCTCCTCA CACTGCTCGA CATTGTGACG
CCGGTGATAG CGACAGTCCC GCGTTCCATT CCATTCATCG TTCCGCCAAT CGCCGCATCG
CTCGTCAGCA TTCCACGATT GCTGGTACGC ATCGGCGCAG CGCGCGAGCG CCGCCGCCGC
GCCACCGACC GACCTGCGCC TGTGCTGATC ATGGGGGCCG GCGATGCTGC ATCCATTATT
GTGCGAGAGA TTCAGCGCAA CCCCAGGCTC GGCATGGAGG TTGTCGGGTT GCTGGATGAC
GATCCGACGA AGCGCGGCTT GCGATTGCAC GGCGTCGAGG TGCTTGGCGA CCGCCATGCC
ATTCCATCGC TGGTAGCGCA ACATAAAGTA CGCCAGGTCA TTATCGCCAT GCCGGGCGCG
CCAGGGAAAG CGGTGCGCGA CATCATGCAC ATCTGTGAGT CTGTCGGAGT TGCCGTGCGC
ATTGTGCCCG GCATGCACGA ACTGATCGAT GGCACGATCA GCGTCAGCAA ACTACGCACC
ATTCAAATCG AAGACTTGCT CCGCCGCGCG CCGGTGCAGA CCGACACAGC CGCAGTGCGC
GGGCTGGTTG CCGGGCGACG CGTGCTGGTG ACCGGCGGCG GCGGCTCGAT TGGAAGCGAA
CTCTGCCGCC AGTTGCTGCG CTTCGGTCCA TCGCATCTCA TCGTTCTCGG ACACGGCGAG
AATAGTGTTT TCGAGATCTG CAATGAACTC GACTGCCTGG CGGAAGCGCA CCTTGATCAA
TCGCCCTGCA TTGTACCGGT CATCGCCGAT ATTCGCGACC TGGAACGGCT GCGCTCCGTT
TTTGCCATCC ACGCACCCGA ACTCGTTTTC CACGCCGCAG CGCACAAACA CGTCCCGCTC
ATGGAAGCGC ATCCGGTTGA AGCCATCAGT AACAATGTCG TTGGAACGCG CAATCTGCTT
GATGTCGCGC TCGAAACCGG CGTCGAACGT TTTGTGATGA TCTCATCGGA CAAAGCAGTC
AATCCGACGA GCGTTATGGG TGCAACCAAG CGCATCGCCG AAATGCTCGT TCTCGACGCC
GCGCGGCGCA GCGGTCGTCC TTTTGTGGCG GTACGTTTCG GCAATGTGCT CGGCAGTCGC
GGGAGTGTCG TGCTGACGTT CAAGCGCCAG ATTGCAGCAG GGGGACCGGT GACCGTCACC
CATCCAGAAA TGCGGCGCTA TTTCATGACC ATTCCCGAAG CCGTGCAACT CGTGCTTCAG
GCATCGGTGC TGGGACGCAC GGGTGAGATC TTTATGCTCG ACATGGGCGA ACCGGTGAAG
GTAGTCGATC TGGCGCGCGA CATGATCCGT CTATCGGGGC TGGAAGTCGG GCGCGACATC
GACATCTGCT TTACGGGGAT GCGCCCCGGC GAGAAACTGT TTGAAGAACT GTTCGCCCGT
GGCGAAGAGT ATCAGCCGAC GGCGCACAGC AAAATCTTCA TCGCGGCAGG CGCCAGCAAC
AACATTCCGC TCGCGCTACG CACCGACGTG ACTTCACTCG AACGGACGGC ATGCACCGGC
AACAATGCCG CCATCCGCCG TCTGCTGCGC GACATTGTAC CGGAATACTG CCCACCGGAG
TTTCTGCCGC CTGTACCGGT CAATGACAGA ACCACGCAGC CCGTGCGGAT TCGTCCGTTG
CAACCGGCAT GA
 
Protein sequence
MMQRLKNRHF LIFDVLLVPL AIYVSFVLRL ETFDLKTYWV ACAHFCLMAV IVTPLIFRAF 
GVYRRYWRYA SFEEVLLLCS ATSLAMGATA ILLTLLDIVT PVIATVPRSI PFIVPPIAAS
LVSIPRLLVR IGAARERRRR ATDRPAPVLI MGAGDAASII VREIQRNPRL GMEVVGLLDD
DPTKRGLRLH GVEVLGDRHA IPSLVAQHKV RQVIIAMPGA PGKAVRDIMH ICESVGVAVR
IVPGMHELID GTISVSKLRT IQIEDLLRRA PVQTDTAAVR GLVAGRRVLV TGGGGSIGSE
LCRQLLRFGP SHLIVLGHGE NSVFEICNEL DCLAEAHLDQ SPCIVPVIAD IRDLERLRSV
FAIHAPELVF HAAAHKHVPL MEAHPVEAIS NNVVGTRNLL DVALETGVER FVMISSDKAV
NPTSVMGATK RIAEMLVLDA ARRSGRPFVA VRFGNVLGSR GSVVLTFKRQ IAAGGPVTVT
HPEMRRYFMT IPEAVQLVLQ ASVLGRTGEI FMLDMGEPVK VVDLARDMIR LSGLEVGRDI
DICFTGMRPG EKLFEELFAR GEEYQPTAHS KIFIAAGASN NIPLALRTDV TSLERTACTG
NNAAIRRLLR DIVPEYCPPE FLPPVPVNDR TTQPVRIRPL QPA