Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3820 |
Symbol | |
ID | 5210802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4779003 |
End bp | 4781240 |
Gene Length | 2238 bp |
Protein Length | 745 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640597416 |
Product | hypothetical protein |
Protein accession | YP_001278124 |
Protein GI | 148657919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.081896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00152117 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATCTGA TGATATTGTT TGCGCGTCTG TTTTCCTGGG CGCTGGTGGT CACGCTGATG ATCGGAAGCA TGGGTGTGGC GCATGCGCAG AGTACTGAGC CGCCACAACC GGATGATCAA AATGTTGTTG CGTTTGAACG CCTGGGAGTA ACCGATCAGA CGCTGAGCGG CGTCTTTGAC GGAACACGTT ATCTGTTCAA CATTCCTGCG AACTGGCGAC TGGCATCGGG GGCGCAGGCG CAACTCGATC TGTCAGTTTT CTTTCCGGTG GGCAGTGCAC AGCAACGGTT GGGGGGATTT CTGGAAGCGC GCTTCAATCG GGTGTTGATC GGCACGGTTG AACTGACCCA ACCGGGTGAC CGGCGTGTTG TCTTTGATAT TCCTGATCGG GCGCTCACAC CGGTACGCAG CGATGGTCGT CATGAGTTTG AAGTGGCGTT GGACAATCCA ACGGGGTGTG ATGTTGCTCC AAGTGAGCGC ACGGCAGTTG TCATCCGTTC GACATCGCGC TTTGTTCTGC CGCACACGCT GGCGCCGCTC GATACCGATC TGCGCAATCT GCCGCGCCCC ATTTTTCAGG GTTCATTCGA ACCCGATCAG GCGACGATCG TCATCCCTGA TACTCCGTCG ATCAGCGATC TTCAGGCAGC GCTGACCGTT GCCGCCAGTT TTGGGCGGTT GACCGAAGGG CGTTTGCAGA TCGATCTGAC GACCGTGCAG CGGCTGTCGC CACAGGCGCG CACCGGCCGT CATCTCATTC TGGTTGGCAG TCACACGGGG CTTGCACCGC TGGCGCGCAA TCTGGATCTG CCCGCAACGC TACGGGAGAA TGGTTTTGCT GCTCCCGGCG CCACACCCGA CGACGGCATT CTCCAGATGA TCGTATCGCC CTGGAACTCG GAGCGTGTCG TTCTGGTGGT GAGCGGTGCA TCGGAGGCGG CGGTTGTGAA GGCGGCTCGC GCGTTGAGCG CCGTGCCTGT GCGGATTAAC AATCGCCCGA ATGTCGCGGT GGTTCGTGAC CTGCCGGAGG CGCCAGCGGA TGTGGCGCTG GCAATCGATC AACGCCTGAG CGATCTGGGA TTGGAACCAC GCGTCATTCG GGATCGCACG GGAACATTCG ATCTGAGGTT CACCCTTCCG CCGGGGCAGC AAATCGATGA GGGAGCGTAC TTTGATCTGA CATTCAACCA TGCCGCAACG GTCGATTTCG GTCAGTCAAG TCTCTCGGTC GGGTTGAACG GCATCCCGAT TGGCAGCGTG CGCTTTAGCG ACGAAACGAC GCGCGTGACA ACCGGTCGGA TCACTATTCC GCCTTCCGCA ACCCGTTCTG GTGCGAATGT GCTAACAATT CAGACGAACC TTGTGCCGCG CTCGTTGTGC ACCGATGTTC GCAATACCGA CCTGTGGGTG ACGATCTGGC CCGAGTCGGC GCTGCATCTG CCGATGAAAC CGGTGACTGC CGAACCGCGT CGCACCTTCA ATCTCAGCAG TTACCCGCTG CCATTTACCC TGAATCCATC GCTCGCCACG ACTGCATTCG TCGTTCCACA ACGCGCTCCT GCCGCCTGGA ATGCCGCTGC TTTACTGGCC TTCCAGATGG GACGGCAGAC CCGTGATGCC ATCCTGCAAC CGCTTGCAGT TTTCGCCGAC AATGTACCGA CGGATGTGCG CGAATCGTAT CATCTGCTGG TGATCGGACG ACCGGGCACC CTGCCGATTC TGGTCGAACT CGGCGATGCG CTGCCAGCGC CGTTCGATGC TGGCAGCGAT GTGCCGCGAT CAGTTGATAC GCCCGTTGTG TATCGCGTGC CGCCTGACGC CAGCGTCGCC TATCTGCAAA TGGTCGCTGC GCCGTGGAAT CCAGAGCGGG TGGTTGTTGC TGCGCTGGGG AGCGATGATT CGGGTATTGA ACAGGCGACA ACTATGCTGA TCGATCCGCG TCAACGTGCG CGTCTGACCG GGACGCTGGC GATTGTCGAT CCGCAGCAGC GGGTGACCCT GGGCAACGGT CGCGCGGTTC TCACGGGATC TGCGCCGACG CCGGTTGCGA CAGTCACGCC GGTTGCCGTT CAACCGACGC AACCGACGCC GCAGACGGTT CAACCAACGC CTGCACCCTC CAGGAATGCG TCGTCGAACA ACTCGTGGCT TGTGCCGGTT GTTATTATTG TTGCTGTCAT CGGTGCGAGC GCGTTGATCC TTTGGCGCGC ACCATGGCGA CGTCCGCCAG GAACCTGA
|
Protein sequence | MHLMILFARL FSWALVVTLM IGSMGVAHAQ STEPPQPDDQ NVVAFERLGV TDQTLSGVFD GTRYLFNIPA NWRLASGAQA QLDLSVFFPV GSAQQRLGGF LEARFNRVLI GTVELTQPGD RRVVFDIPDR ALTPVRSDGR HEFEVALDNP TGCDVAPSER TAVVIRSTSR FVLPHTLAPL DTDLRNLPRP IFQGSFEPDQ ATIVIPDTPS ISDLQAALTV AASFGRLTEG RLQIDLTTVQ RLSPQARTGR HLILVGSHTG LAPLARNLDL PATLRENGFA APGATPDDGI LQMIVSPWNS ERVVLVVSGA SEAAVVKAAR ALSAVPVRIN NRPNVAVVRD LPEAPADVAL AIDQRLSDLG LEPRVIRDRT GTFDLRFTLP PGQQIDEGAY FDLTFNHAAT VDFGQSSLSV GLNGIPIGSV RFSDETTRVT TGRITIPPSA TRSGANVLTI QTNLVPRSLC TDVRNTDLWV TIWPESALHL PMKPVTAEPR RTFNLSSYPL PFTLNPSLAT TAFVVPQRAP AAWNAAALLA FQMGRQTRDA ILQPLAVFAD NVPTDVRESY HLLVIGRPGT LPILVELGDA LPAPFDAGSD VPRSVDTPVV YRVPPDASVA YLQMVAAPWN PERVVVAALG SDDSGIEQAT TMLIDPRQRA RLTGTLAIVD PQQRVTLGNG RAVLTGSAPT PVATVTPVAV QPTQPTPQTV QPTPAPSRNA SSNNSWLVPV VIIVAVIGAS ALILWRAPWR RPPGT
|
| |