Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4007 |
Symbol | |
ID | 5210990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5012044 |
End bp | 5014974 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640597596 |
Product | hypothetical protein |
Protein accession | YP_001278302 |
Protein GI | 148658097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.109226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCG TTGCACTCCG TCCCTCGCCT TCAAACCCGC CGCCGGCGAC CCGTCCGCCG CGTCCCGACC TGCCGTTCCA TCTCGTGCGC GCGCTGGCGT TGATCTGCAT TGTGCTGGCG CTTGTCCAAC CGCTGGCGCT GCCGCAGGTT GCCGACTCGC CTTCATTCCA GGCGATCCTG TTACCCCGTG TCTTGCACGA GCCAGGAGTC GTTGCGCCAG CATTGCTGCG GCGTGCATTC GAGCCGCCGC TGTTCCCGAT GATCAGCGCC GTGGCAGTGG CGTCGCTGTG CATCCTTGCG GGGGAACTCA GCGTGGCGCT GATCGCGGCG ATCCTGGCGC GCCACGCTGC ACGTCGCGCT ACGCGCTCGA CGATCTGCCT GCGGGTTCGC CCCGCGCATA CCCTCGCCGG GACGAGCGCA TCGGTCGCCA GACCAGGCGC GCTTATGCGT ATGATTCATG GCGGCATGGC GAAGCGTTTC TGGATGCATC CTGCTCCACC GTATACCCTG ATCATCAGCG GCGCGCCAGA TCTCCCCGCA GAACTGGGGG CGCTGATCAG CGGCACGCCA GACGATCAAC AACGCGCTAT GGGTGTGCTG GACGGCGCAG TACGCAGCAG CGCTCCGGGC GCACAGATCG ATGCCACCGA TGATCCGCTC ATTGCAGCGG CAACTCCTGG ACGCTGGATC GCCTGGCAGC GCTTCGGTCT GGCGCTGCCG CCAGCCTATC CGCTCCATAC ACCGGCGCTC GTTGTTGAAC ATGAACTGGC AGGAGTATTG CTGGCGGCAG TCCGACCCCA CGGCAGCGTA GTGCACGCCG GGCTGGAGAT CGCCCTGCGC CCGCAGGGTG GCATGACCGG ATGGGCGCTG GGGAGGCAAT GGCGCGCACG TGCGATGGCG TTGAAACTGA CGCTTGAACA GCGCCAGGAT TATGCGCTTG CGCCGGATAT TGCCGCCATC GAGGCAAAAC TGGGCGATGC CGCATTCGAG GCGACCATCG TGGCAACCGC CGTCGCCGGG CAGCGTGATG ACGCAATCAC TGCGCTCCGC GCCATCGGTG ATGCGCTTGG CGCATTCCAG CAACGCACCG CAAGCCGCTT GCAGCGTCTC GTCCCGCTTG CCCGTGCGTC GGTCCGTCAG ATAACCGTGG GTCAGGAGAT GCAGATCGTA TCGCACCTGC GCACGCCGCC GGTCTCACAT CCCCCCACGC TGCTACTGCC GTTCCGTCTC TGGCGTGGAC CGGACATTTT GAGCGCCGGA GAAGTGGGGT ATCTGTGGAA TCTTTCCAAT CCTCAGTCGA ACGGATTGAT GCGCTGTGAT CCGTGCCGCC GGATTGCAGC GCCGCCTCAC GCCTTCTGCG CCGACGATCC GGAGCGTATC ATCGTCGGAT ACGCCGCGCA TACCGATGGA CACAGCGCAC CGGTTGGACC GACGCTGCGC GATCTGCGTC AGATTCTGCA CCTGACCGCC GGTATGGGCG CCGGAAAGAG TCGTTTGCTG GCGAACCTGT GTCAGCAACT GATCCCACAC GGATTCATGC TCATCGACGG CAAAGGCGAT GATCGTGGCG GTAGTCTGGT TGATGTCGTG CGTCAACTGA TTCCGACGGT GGACGAAGGA CGGTTCATCC TGCTCGATCC GCTCGACACG GCATGGCCCA TCGGACTGAA CCCGCTCGCG GGCGCCGATG TCACGCAACC GGGCGGCGCC GATCTGGCGC TGGGGCAAAT CCTGGCAACC TTCGCGCGCA TCGATCCTGA TACGTGGGCG CGATCGCCGG GCATGCAGCA GTTCGCGCGT ATGGCAGCGC TGCTGGTGCT GGAGGGCGAA ACGCATCCCA CACTGGCGCA CGTCAAACAG GCGCTGATCG ACGAAGCGTA CCGCCAGGAA CTGCTCCGGT CGGCGCGGAA CATCGAGGTC GCCAGTTTCT GGCTTGAGAC ATACCCGCGC CTGGGTGAGG GGCAGCGATC CAGTTGCGAC GCGCTGTTGA GGCGTTTCGA TGCGCTGTTG ACCGCTGAAA CGACGCGCTA TCTGGTGGCG CAGGCACGCC CCACGCTCGA TCTGGCGCAG ATGATTGCCG ACCGGATGAT CGTGCTGGTT CCGCTTCCCG ATGTGGCGCT TGGCGGGCTG GCAGGCGCGG TCGGCATGCT GATCGCCCAG GCATTTGTGC GTGCCGCCTT CAGCCGTAGC GGCAGCGACC GCACCCGCCA CGATTATCCG TTGATCATCG ATGAACTCCA GGTGCTGATC GGCAACAGCG ACACAACCGA TATAGCGACT GCCATCACGC GCCTGCGGTC GCTCGGCATT CCGACGATCT ACGCCCATCA GGCGCTGGCG CAGTTGGGTG ATCTCCGCGA CCTGATGCTG ATCAATGCCG GGAACCGCAT CATGCTGCAA ACCCAGGAAC CCGATGCCAG CATCTATGCG CGCGCCTACG GCGCCAGCGG TCTCACCGCC GCCGATCTGA GCGGGCAACC GCCGAACGAC CATCAGTACG CCGTCCTGCG CTGCCGGGGC GTGGTGGCCG GACCTTTTTC GATGCAACCG CTCCCCTGGC CCGCTGTGCA CGACGAGTCG CCGCCGCCAT ACCAGGGACC GGCGTGGCGC GACGTTCTGC CCGACGACGC CGATCCGGCG GACCGGTTCA TTGCGCGCGT CGTCTACGAT GCCGGTACAG GCGCAACTGC CGCCAATGAA CTGGCGCACC TCAGCGACGC CGACTGGCAG CGACTTCTGC GGCGGTGGGA GCGGATCAGG CACGTTCAAC GGCAGCACAT CCTTGCGCAT CCCGGTTGTA TCGCCGACCG ACTGGAACGT CAACGCTGGC TATCGCGTCT CTATGCGGCG CGTCCGCGGG TGCTGGCTGC CGCCGAATAC CTGCGCGGGC GTCAGAAAAG ATCGCATCAG CACGCCTTAA ACTCGACGTG A
|
Protein sequence | MSRVALRPSP SNPPPATRPP RPDLPFHLVR ALALICIVLA LVQPLALPQV ADSPSFQAIL LPRVLHEPGV VAPALLRRAF EPPLFPMISA VAVASLCILA GELSVALIAA ILARHAARRA TRSTICLRVR PAHTLAGTSA SVARPGALMR MIHGGMAKRF WMHPAPPYTL IISGAPDLPA ELGALISGTP DDQQRAMGVL DGAVRSSAPG AQIDATDDPL IAAATPGRWI AWQRFGLALP PAYPLHTPAL VVEHELAGVL LAAVRPHGSV VHAGLEIALR PQGGMTGWAL GRQWRARAMA LKLTLEQRQD YALAPDIAAI EAKLGDAAFE ATIVATAVAG QRDDAITALR AIGDALGAFQ QRTASRLQRL VPLARASVRQ ITVGQEMQIV SHLRTPPVSH PPTLLLPFRL WRGPDILSAG EVGYLWNLSN PQSNGLMRCD PCRRIAAPPH AFCADDPERI IVGYAAHTDG HSAPVGPTLR DLRQILHLTA GMGAGKSRLL ANLCQQLIPH GFMLIDGKGD DRGGSLVDVV RQLIPTVDEG RFILLDPLDT AWPIGLNPLA GADVTQPGGA DLALGQILAT FARIDPDTWA RSPGMQQFAR MAALLVLEGE THPTLAHVKQ ALIDEAYRQE LLRSARNIEV ASFWLETYPR LGEGQRSSCD ALLRRFDALL TAETTRYLVA QARPTLDLAQ MIADRMIVLV PLPDVALGGL AGAVGMLIAQ AFVRAAFSRS GSDRTRHDYP LIIDELQVLI GNSDTTDIAT AITRLRSLGI PTIYAHQALA QLGDLRDLML INAGNRIMLQ TQEPDASIYA RAYGASGLTA ADLSGQPPND HQYAVLRCRG VVAGPFSMQP LPWPAVHDES PPPYQGPAWR DVLPDDADPA DRFIARVVYD AGTGATAANE LAHLSDADWQ RLLRRWERIR HVQRQHILAH PGCIADRLER QRWLSRLYAA RPRVLAAAEY LRGRQKRSHQ HALNST
|
| |