Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4356 |
Symbol | |
ID | 5541869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5607920 |
End bp | 5610841 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640896462 |
Product | hypothetical protein |
Protein accession | YP_001434398 |
Protein GI | 156744269 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0703153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAA CATCCTCTCC CGGTTCGTCA CAACCTGTCG TCTCCAGTCT CTCGCTGAGG CGCTGGAGCA AATGGTGGTC CCCCGGATCA CTCATCGGCG TGTTGCGCGT CGTCAGCCGA CGGTTGTGGA GCCACCTGGG GCTGATGCTC GCCATCGCCG TCGGCTTCAT CGTCGCTATC GGGTTGACGG TCAGTATTCC GGTGTATGCC GAGGCGGTCG GCTATCGTAT CCTGCGTGAT GAACTGGCTC AGGGGGAAGC AGGCTCCAAA CGACCGCCGT TCGCCTTTAT GTTTCGTTAC CTCGGCTCGC AAACCGGCGT CATTTCCTGG CGCGACTATG CCCCGCTCGA TGAGTATATG CGCACCCAAC TGGCGGAACG CCTTGGTCTG CCGATTATGA CAGAGGTGCG CTACGTCGCC ACGGACAAAG CGCCCCTGAT GCCCGCCGGC GGCGTCGGGA AGCCGCTGAT CTTTGTCAAT ACCGCCTTCG CCACCGATTT TGAGCAGCAC ATTGATATTA TCGATGGTGC ATTTCCACAA CCGGCATCGA CGGACGGACC GATTGAGGTG TTGATTTCGG AAGATCTGTC GGGACGGCTT GGCTTTCAGG TTGGAGAGGA GTACCTCATC CTTGGTCCAC AGGAACAGCG CGTCGATCAG AGTTACCCGA TCCGTATCGC AGGAGTCTGG CGCGTGCGTG ATTCCGGCAG CGACTACTGG TTCTACGATC CGGTGACTCT CTACGATACA CTCTTTGTGC CGGAAGAGAG TTTTCGTGAC CGTCTGACAG CGATTAATCC AAAACCGATC TATGTTGCTA CGTGGTATGC GATTGCCGAC GGGAGCAGTG TGCGTTCTTC GGATGTCGCT GATGTGCGCG CGCGCATCAA TCGCATCGCC ACCGATATCA CCACCATTTT GCCCGGATCG CGCATCGACA TCTCGCCAGC GCAGGCATTG GCAGAGCATC AGCGCCAGGT ACAGCGGCTG ACCCTCATTC TGACGATCTT TAGCGTTCCG GTGCTTGGTC TGATTGCGTA TTTCATTCTG CTGGTCGCCG GTTTGGTCGT GCAGCGCCAG AGCAACGAGA TTGCTGTATT GCGCAGCCGT GGCGCTTCGC GCCTTCAGGT GCTCGGTATC TACCTGATCG AGGGCTTGCT GCTCGGCATC GCCGCACTGG CGGCGGGGGT CGCCGTGGGG CAGGGCGCCG GGTTGCTGAT GACCTGGACG CGCTCGTTCC TCGATGTCCA ACCGGGTGAG TGGTTGCCGA TTGAACTTAC TCCCGACGCC TGGCAACGCG CCTGGCAAAT GCTGATAGTG ATGGTGCCTG CAAGCCTGTT GCCTGCGTTC GGAGTCGCCC GCTATACGAT TGTGTCGTTC AAGAGCGAAC GCGCGCGGGC GACGCGCAAA CCGTTCTGGC AGCGCATGTA CCTCGATCTG CTCTTGCTGC TGCCGGTCTA TTATGGTTAT ACGCTCCTGG AACAGCGTGG CACAGTGGCA TTCCTTGGCG CGGCTGATGA TCCGTTTGGC AATCCGCTGT TACTGCTGGC GCCGACCCTC TACATGTTCA CGCTGGCGCT GGTGGCGACG CGCATCTTTC CGCTGGCGAT GAGCGCACTC GAGTGGCTGG CGCGGCATAT GAGCGGTGTG GCGACGGTGA CGGCGCTCCG GTATCTGGCG CGTACTCCCG GCGCCTACAC CGGTCCGGTG TTGCTGCTGA TCCTCACCCT CAGCCTTGCA ACATTTACCG CGTCTATGGC GCAAACGCTC GACCGCCATC TGATCGATCA GGTGTACTAC GAATCCGGCA GCGACATTCG GTTGTACGAT CTGGGACAGA GCGGCGGTTT CTCCGGTCCT ATGGCGGGGA TGCAACCGCA ACAGCCGCTG GTGTCCGATG GCATGCAGGA GGCGCGCTTC ATGTTCCTGC CGGTCACCGA TTACCTGACC ATCCCCGGCG TCGAGGCGGC GACGCGCGTA TCGGTCAGTC AGGTTGAGAT CACTCTGGCC AATCGCACCA TCCCTGCTCG TTTCATCGGT GTTGACCGGG TGGACCTGCC CGCCGTCATT CACTGGCGCG CCGATTATGC CGGTGAGTCA CTCGGCGCGC TGATGAACCG TCTGGCGGAC GACCCTTCCG CCGTGCTGGT CAATAGCGCG TTTGCGGCGC AAAACCGTCT GCGTCCCGGC GACCGCTTTG AGGTGGTCAT GAACGACCTC GACCGGAAGG TTCGTGTGCC GGTGATCGTT GTCGGGTATG TGAACCTCTT CCCGACGGTC TATCCAACTG ATGGTCCGTT CTTGATCGGA AATCTCGACT ATGCCTTCGA TATGCAGGGC GGGCAATATC CCTACGATGT CTGGCTCCGG TTGGCGCCGG GTGTTGAGCG CCAGACGATT GACGAAGGGC TGCGCGAACT GGGGTTGCGC ACCTTCGAGC GTGGCTTTGC GCCGACGATC ATCGTCGCCG AGCATGCGCG CCCCGAACGA CAGGGTTTCT ACGGCTTGCT GTCAGTCGGG TTCATCGCCT CGGCATTTCT GACGGTGCTG GGGTTCTTGT TCTATTCAGC GCTCTCATTC CAGCGCCGGT TCGTCGAACT CGGTATGCTG CGCGCCATCG GGCTTTCGAC CCGGCAACTC GGCGCGTTGC TGGCGTGGGA GCAGGCGCTG ATTATCGGCG CCGGCATGAT TGGCGGCACG CTGATCGGCG TCACTGCCAG TCAGTTGTTT ATCCCGTTTT TACAGGTCCG TCGTGGCGCC AACGCGCAAA TCCCGCCATT TGTCGTCCAG ATCGCGTGGG AGCAGATCGC CATTATCTAC ATGGTCTTTG GCGCGATGCT GATTGCGGCT GTGCTGATTA CCATTGCTCT GCTCCGGCGC ATGAAACTGT TCCAGGCAGT CAAATTGGGA GAAGCGATCT GA
|
Protein sequence | MATTSSPGSS QPVVSSLSLR RWSKWWSPGS LIGVLRVVSR RLWSHLGLML AIAVGFIVAI GLTVSIPVYA EAVGYRILRD ELAQGEAGSK RPPFAFMFRY LGSQTGVISW RDYAPLDEYM RTQLAERLGL PIMTEVRYVA TDKAPLMPAG GVGKPLIFVN TAFATDFEQH IDIIDGAFPQ PASTDGPIEV LISEDLSGRL GFQVGEEYLI LGPQEQRVDQ SYPIRIAGVW RVRDSGSDYW FYDPVTLYDT LFVPEESFRD RLTAINPKPI YVATWYAIAD GSSVRSSDVA DVRARINRIA TDITTILPGS RIDISPAQAL AEHQRQVQRL TLILTIFSVP VLGLIAYFIL LVAGLVVQRQ SNEIAVLRSR GASRLQVLGI YLIEGLLLGI AALAAGVAVG QGAGLLMTWT RSFLDVQPGE WLPIELTPDA WQRAWQMLIV MVPASLLPAF GVARYTIVSF KSERARATRK PFWQRMYLDL LLLLPVYYGY TLLEQRGTVA FLGAADDPFG NPLLLLAPTL YMFTLALVAT RIFPLAMSAL EWLARHMSGV ATVTALRYLA RTPGAYTGPV LLLILTLSLA TFTASMAQTL DRHLIDQVYY ESGSDIRLYD LGQSGGFSGP MAGMQPQQPL VSDGMQEARF MFLPVTDYLT IPGVEAATRV SVSQVEITLA NRTIPARFIG VDRVDLPAVI HWRADYAGES LGALMNRLAD DPSAVLVNSA FAAQNRLRPG DRFEVVMNDL DRKVRVPVIV VGYVNLFPTV YPTDGPFLIG NLDYAFDMQG GQYPYDVWLR LAPGVERQTI DEGLRELGLR TFERGFAPTI IVAEHARPER QGFYGLLSVG FIASAFLTVL GFLFYSALSF QRRFVELGML RAIGLSTRQL GALLAWEQAL IIGAGMIGGT LIGVTASQLF IPFLQVRRGA NAQIPPFVVQ IAWEQIAIIY MVFGAMLIAA VLITIALLRR MKLFQAVKLG EAI
|
| |