Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1360 |
Symbol | |
ID | 5538832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1738511 |
End bp | 1740442 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640893497 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001431474 |
Protein GI | 156741345 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCAAC GACTCAAGAA TCGGCACTTT TTGATCTTCG ATGTGCTGCT CGTGCCGCTG GCAATCTACG TCAGCTTCGT GCTGCGCCTC GAAACGTTCG ATCTCAAGAC GTACTGGGTG GCATGTGCGC ACTTCTGCCT GATGGCAGTC ATCGTCACTC CACTGATATT TCGCGCATTC GGCGTCTATC GCCGCTACTG GCGCTACGCT TCGTTCGAGG AAGTGCTGCT GCTCTGCAGT GCAACCTCGC TGGCAATGGG AGCCACTGCG ATCCTCCTCA CACTGCTCGA CATTGTGACG CCGGTGATAG CGACAGTCCC GCGTTCCATT CCATTCATCG TTCCGCCAAT CGCCGCATCG CTCGTCAGCA TTCCACGATT GCTGGTACGC ATCGGCGCAG CGCGCGAGCG CCGCCGCCGC GCCACCGACC GACCTGCGCC TGTGCTGATC ATGGGGGCCG GCGATGCTGC ATCCATTATT GTGCGAGAGA TTCAGCGCAA CCCCAGGCTC GGCATGGAGG TTGTCGGGTT GCTGGATGAC GATCCGACGA AGCGCGGCTT GCGATTGCAC GGCGTCGAGG TGCTTGGCGA CCGCCATGCC ATTCCATCGC TGGTAGCGCA ACATAAAGTA CGCCAGGTCA TTATCGCCAT GCCGGGCGCG CCAGGGAAAG CGGTGCGCGA CATCATGCAC ATCTGTGAGT CTGTCGGAGT TGCCGTGCGC ATTGTGCCCG GCATGCACGA ACTGATCGAT GGCACGATCA GCGTCAGCAA ACTACGCACC ATTCAAATCG AAGACTTGCT CCGCCGCGCG CCGGTGCAGA CCGACACAGC CGCAGTGCGC GGGCTGGTTG CCGGGCGACG CGTGCTGGTG ACCGGCGGCG GCGGCTCGAT TGGAAGCGAA CTCTGCCGCC AGTTGCTGCG CTTCGGTCCA TCGCATCTCA TCGTTCTCGG ACACGGCGAG AATAGTGTTT TCGAGATCTG CAATGAACTC GACTGCCTGG CGGAAGCGCA CCTTGATCAA TCGCCCTGCA TTGTACCGGT CATCGCCGAT ATTCGCGACC TGGAACGGCT GCGCTCCGTT TTTGCCATCC ACGCACCCGA ACTCGTTTTC CACGCCGCAG CGCACAAACA CGTCCCGCTC ATGGAAGCGC ATCCGGTTGA AGCCATCAGT AACAATGTCG TTGGAACGCG CAATCTGCTT GATGTCGCGC TCGAAACCGG CGTCGAACGT TTTGTGATGA TCTCATCGGA CAAAGCAGTC AATCCGACGA GCGTTATGGG TGCAACCAAG CGCATCGCCG AAATGCTCGT TCTCGACGCC GCGCGGCGCA GCGGTCGTCC TTTTGTGGCG GTACGTTTCG GCAATGTGCT CGGCAGTCGC GGGAGTGTCG TGCTGACGTT CAAGCGCCAG ATTGCAGCAG GGGGACCGGT GACCGTCACC CATCCAGAAA TGCGGCGCTA TTTCATGACC ATTCCCGAAG CCGTGCAACT CGTGCTTCAG GCATCGGTGC TGGGACGCAC GGGTGAGATC TTTATGCTCG ACATGGGCGA ACCGGTGAAG GTAGTCGATC TGGCGCGCGA CATGATCCGT CTATCGGGGC TGGAAGTCGG GCGCGACATC GACATCTGCT TTACGGGGAT GCGCCCCGGC GAGAAACTGT TTGAAGAACT GTTCGCCCGT GGCGAAGAGT ATCAGCCGAC GGCGCACAGC AAAATCTTCA TCGCGGCAGG CGCCAGCAAC AACATTCCGC TCGCGCTACG CACCGACGTG ACTTCACTCG AACGGACGGC ATGCACCGGC AACAATGCCG CCATCCGCCG TCTGCTGCGC GACATTGTAC CGGAATACTG CCCACCGGAG TTTCTGCCGC CTGTACCGGT CAATGACAGA ACCACGCAGC CCGTGCGGAT TCGTCCGTTG CAACCGGCAT GA
|
Protein sequence | MMQRLKNRHF LIFDVLLVPL AIYVSFVLRL ETFDLKTYWV ACAHFCLMAV IVTPLIFRAF GVYRRYWRYA SFEEVLLLCS ATSLAMGATA ILLTLLDIVT PVIATVPRSI PFIVPPIAAS LVSIPRLLVR IGAARERRRR ATDRPAPVLI MGAGDAASII VREIQRNPRL GMEVVGLLDD DPTKRGLRLH GVEVLGDRHA IPSLVAQHKV RQVIIAMPGA PGKAVRDIMH ICESVGVAVR IVPGMHELID GTISVSKLRT IQIEDLLRRA PVQTDTAAVR GLVAGRRVLV TGGGGSIGSE LCRQLLRFGP SHLIVLGHGE NSVFEICNEL DCLAEAHLDQ SPCIVPVIAD IRDLERLRSV FAIHAPELVF HAAAHKHVPL MEAHPVEAIS NNVVGTRNLL DVALETGVER FVMISSDKAV NPTSVMGATK RIAEMLVLDA ARRSGRPFVA VRFGNVLGSR GSVVLTFKRQ IAAGGPVTVT HPEMRRYFMT IPEAVQLVLQ ASVLGRTGEI FMLDMGEPVK VVDLARDMIR LSGLEVGRDI DICFTGMRPG EKLFEELFAR GEEYQPTAHS KIFIAAGASN NIPLALRTDV TSLERTACTG NNAAIRRLLR DIVPEYCPPE FLPPVPVNDR TTQPVRIRPL QPA
|
| |