Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3887 |
Symbol | |
ID | 5541393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5087102 |
End bp | 5089372 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640895998 |
Product | hypothetical protein |
Protein accession | YP_001433941 |
Protein GI | 156743812 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCTC AACAGACGCT GCGCAGTGCG CGGTCCGTGA CGACGATTGA TCTCATCTTG CTGGCTGCTA TTGTTGCGCT GGCGCTGGTC ACACGCCTCT GGTTCTGGCA GGTGCAGGCG CGTTCCGGCG CCGTGCCGCC GGGCGATCCG GAAGAGTACT ATCGCGCTGC CATCCATATG CTGCACGGCG GGTACCACGA TACCGGCAAA TGGCTGCGCC CGCCGGTCTA TCCGGCATTC CTGGCGCTGC TCTTGCCACC GACGAGAATG AATGTCGCCG GAGCGCTGTT GCTTCAGGCG TGTGTTTTAG GCATCGGAAC GCTGGTATTC TATGCCTCCG GCACGCAACT GTTCGGGCGC GTCACAGGAA TGGTGACGAC ATTGCTCGCA GCACTGTTCG TGCCGCTGGC ATCGTATGCG AGTTCGCTTT ATGCCGAAGC GCTGTTTGTG ACACTCCTGG TCATTGGTCT TGCGCTGATC GACCGCGCGC TGGTTCGCAA CAGCACGCGA GCGGCATGTG GCGCCGGCGT GCTTCTGGCG CTGGCGGCAC TGACCCGCGC CGTGGGTCTG TACCTGATCC CGCTAGCAGC CGTGTGGATT GCCTGGCGCA TGCGACACGG CGGTAGCCTG TCGATAGGGA TGGGCTTGTC TGACACGCGC CTGCGTCGCG TTGCCCGTTC CAAAGATCAC GGCGCCCACA TTCACCCTTC TTCCGATGTA GGAGAAGGAG CGTGGAAGGA TACAGGGAAA GACGCCTCGC AGAGTGAACA GCACATTATG AGTGACAAGA GCGACCGCCT GGAGATACGT TCCTATCAAC TGGCAATCTC CCTTATCCTG GGAGCGCTTC TGGTGGTTGG ACCGTGGGCA GCGCGGAATT ATCTGGCGCA CGGGCGCGTC ATCCTCAGCG ATACCAATGG CGGCATCAGT ATGTGGTACG GCACAGTGCG CGACGATGCC GAAGAGAAGG CAGGCGAAGC GCGGCTGGCG GCTGTGCCCA ACCTCGCCGA CCGGCAATCG CTTGCAATTC AGATGGCGTG GGAGAACATT CGTCATGATC CGGCACGGTT CCTGGCGCGC ATGCGTTTCA AGATTGCGTC GCTCTACGCG CTGCAAACAC GCAGTTATGC CGTCGGCGAT GTTATTTCAA TCGACTCGCG TGGTGCGCCA CTGGTTCAGA ATGCAGGCGA ATATCGCCTG AGCATGACGC TTTTGGCGGA CGTGCAGTAC GTGGCGCTCA TAATCCTGGC AATTGGCGGC GTCTGTTTTA TGCCGCACCC TGCCCGTGCC ATTCCGACGT TGCTCTGGGT GGGACTGGCG ACCCTGCTGG CGGTATTGAC CATTGGACAC CCGCGATTGC GCCTTCCGAT TGTTGCGTCT GTCCTGCCCT TCGCTGCGTA TGCGCTGGTC AGATTGCCCG CAGGATGGCG ACACATCCGT CAATTGCCGC GCGACCGGCG CAGTTATATG GCGCTGAGCG GAGTGATGGT TTTCCTGGCG CTGATCGTCA GCATGCGGTA TATTCCGTGG GGTGCGGGTA TGTGGTATGC TGTGCCGGGA CGATCGGCAC TCGAAGCGGG CGATTTGCGA CAGGCTGAAA CGCTGCTGGC GCTGGCGCAC GATGCTCACC CGGATAACCC GTTGCGCGTG ATCGATCTTG CCGATCTGCG GTTGGCGCAG GGCGATGATC GGGCGGCGCT TAGCCTGTAC CGGCGCGCGG CTGAGATGGA ACGTCGCAGC CTGTATGCGC AGGCGATGCG CGCCATCACC GGCGCGTATC TTGCCATGCC CGACGAAGCG GCAGCAGGAT TGGCAGCGAT CGATGATTAC TGGCGCTCAG GCAACGATCT GCTCGAATGG GCATGGACCA CACGGCGACG TCCTGCACCG GATCGCGTCG TTCCCGGCGA TCCGATGGCG CTGGGACTGT ATGCCGGGTT TGCGCCCGCC ACGCCTGATC TCGCGGTTGG GCGCTGGACC CTGGGAGAAG GACGGGTGCG GGTGCGTGGC GGCTGCGGCG CCTTAGCGGT TCAGTTGCGC GGACCATCCG GGCGTCGGGT AGACATCAGC ATCGACGACT GGGGTATTCG AAAGCGGATG ATAATGAACG GCGAACAACA GGAGGTGCGC CTTGCGCTCT CCGGCATTCG CGAATGTGAA TTCGGACCCG AACTGACTGT GCATATCGTC AGCGAAACGG GACTGCTCGA TCTGGAGCGG GCGCCATGGT ACACGGGCGT GGCAGTGTAC GAGGTGCGTG TCGAACGGTG A
|
Protein sequence | MQAQQTLRSA RSVTTIDLIL LAAIVALALV TRLWFWQVQA RSGAVPPGDP EEYYRAAIHM LHGGYHDTGK WLRPPVYPAF LALLLPPTRM NVAGALLLQA CVLGIGTLVF YASGTQLFGR VTGMVTTLLA ALFVPLASYA SSLYAEALFV TLLVIGLALI DRALVRNSTR AACGAGVLLA LAALTRAVGL YLIPLAAVWI AWRMRHGGSL SIGMGLSDTR LRRVARSKDH GAHIHPSSDV GEGAWKDTGK DASQSEQHIM SDKSDRLEIR SYQLAISLIL GALLVVGPWA ARNYLAHGRV ILSDTNGGIS MWYGTVRDDA EEKAGEARLA AVPNLADRQS LAIQMAWENI RHDPARFLAR MRFKIASLYA LQTRSYAVGD VISIDSRGAP LVQNAGEYRL SMTLLADVQY VALIILAIGG VCFMPHPARA IPTLLWVGLA TLLAVLTIGH PRLRLPIVAS VLPFAAYALV RLPAGWRHIR QLPRDRRSYM ALSGVMVFLA LIVSMRYIPW GAGMWYAVPG RSALEAGDLR QAETLLALAH DAHPDNPLRV IDLADLRLAQ GDDRAALSLY RRAAEMERRS LYAQAMRAIT GAYLAMPDEA AAGLAAIDDY WRSGNDLLEW AWTTRRRPAP DRVVPGDPMA LGLYAGFAPA TPDLAVGRWT LGEGRVRVRG GCGALAVQLR GPSGRRVDIS IDDWGIRKRM IMNGEQQEVR LALSGIRECE FGPELTVHIV SETGLLDLER APWYTGVAVY EVRVER
|
| |