Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2478 |
Symbol | |
ID | 5539959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3187446 |
End bp | 3190577 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894608 |
Product | hypothetical protein |
Protein accession | YP_001432576 |
Protein GI | 156742447 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA TTGCCGAACT GAGCCTCATT CTGATTGCAG CAGCGGTGTT GTGTTGCTAC GTCGGATGGG GTGCGGCGCG GTTGGCGTTG CCACCGTCGC TTGCATCATT CCGCGCGCCA TTGACGCCGC TGATCGGCTA TGTCGTCCTG CTCTGGAGCG GGTTCATGCT GGCGAGCCTG GTGCTCAACT TGCGTTGGAC AGTGGCAGTC ATCCTGATCG GCGCCACCGT TCTGAACATT CTCACCTGGC GTGCAGAGGG GCCGCCACAA CCGCTGGCAT GGTTGCGTGC CCAACCAGAG GCGCTGATCC CACCATTGCT GGCGCTGCTG ACAGGCATCC TGCCGTTGCT CGAGTACGGC TATCCAACGA TCATTGGGCG CGGATGGGAT ACCGAGGCTT ACCTGCCGAT GGCGCAGCAC CTGATCGACT ATTCCCTGCC GCGCATTCCG GAAGCGCCAC AGAGTCTCCT GCGCGACCTT GTGACGCATC CACCGCGAAT CGGGCTAACC CTTGGCTTTT CGATATTTCA TGGAATGACG ATGATCTTCA GCGGCGCCAG CGCACTGGCA TCATTCGCGC CTGTCATTGC GTTTATGCGT GCGCTGGCTG TACTGGCGAT GTATGTCTGG CTGCGCGCAA CGATGGACGC AGGGCGGGTC GGATCATTTC TGGGAGCAAC GCTCACCGCG CTGACCTCGC TGATGCTCTG GATCGGCTAT TTCAACTTCG GGATGCAGAT GTCCGCCTGG TGTCTACTGG CGCTGGCGCT CACCACCGGG CTGGCAGCCG TTGATGATCT GGCACAGCGT CGTCTGGCGG CATGGCGTGG AGCGCTGCTG GCGGCAATCG CTCTGGCTGC CATACCCATC GCCTACTATC CGGCGCTGGT TATCGCCGTT CCGCTCATTT CAGCCGCAGG AGCGGCGCGT CTGTTCGAAA CATGGCGTCA TCCGCAGCCG TACACGACGC CGATCTCGCT GGCGCTGGCG GCGCTGGCAC TTGCCGGATT GACCCTGGCT GCGTCTGCGC TGGCGGTTCA GGATTACTTC GAGGGGTTCA GTTTTCGCTA CTCGCTGATC GAGCCAAAAA TCGGACCGGA TCGGTTTATC GGCGTCGATG AAATCCTGGG ATTGACCGCA TTTCGCCTGT CTAACGACGG TGATCAACCA CCTTCGTTGC TGATCGGAGT TGCGCTGCTG GCGACGGTGT TGCCGGGATG TGCAGCGTTG GTTCTGCCGC ACCGGTGGCG CAATGACGCT GATGCCGGTG AACGCACGCG ATTACGCTGG ACACTGACCA TTGCTGCCGT CGCAGCAGCG TTGATCTGGC TACGCTTTGG CAGACCGTAT GAATATGGCT TTATGAAAGG CGCTGCGTAC ACTTCGTTCG TCATCTGGGG GCTGACTGCG TCAGGGGTAG AACGAATCGC CCAATGGACA AAGCGCACCG GGATGCTGCT GGCGTCCAGC GCTGCTCTGC TGATCCTTGC CTGCACCGGT TGGTCGCAAT CGCTGACGGT CGCCGATCAT ATACGCGGAC CGGCAATCTT CACCCGTGAT ATTGCTGCAT TCGACCGGGT AGCGGCGCAA CTGCCGCATG GCGCGACCGT GTTGTTGAGC GGCGACGAGA CCCTGACCGG ACCGATCAAT GGTATGCTGG CGACAATGCT GTATGGCAAG GAACTCTGGG GACGGGTTCC CGCCGCGTAT GCTGCGCAAT CGTTCTGGTC TCCTGGCGAA ACGCCGAACT ATGTCGTGCT GGCAGCGCGC GAGGACCCCT GGCCCCTGGA CGTTGGCGCG AAGGAGCGCT GGCGGAGTAG CGCGATTGCT CTCTACGAAA TGCCGCCGGA TGCCACCTTT GTTCTGGGAC GCAGCGAGAG TTATGTCATT GCAGCAGTCG ATCCAAAATC GCCCGCATCG CTGGCAATCT GGCGACGTGC CGGGCACAAT CGCGTCATTG CGCCCAACGA ACCCTTTACT CTGGAGATGC CGCACGCAGC GACGTTGCGC CTGACGCTGG CAGCGCTGGA AGCGCAGACG GTAATGTTGC GTCAGGGGCA TACCACCACA ACGCTCTCGC TAGAGGCAGG GGTTACAACG ATCAAAACAG GGAGCAGTTC GACTGTACAG GTCATCCCCA CAGCGCCGCT GGCGCTGGTG CATGCTGTTG TGTCCCCAAC CGATACGCCG ACGCCGGTCT CGACATCGCT CGACATAACG CGCGTGGCAT GGAGCGCAAC GAGCGAACAA CAGGGCGATC AGATCGTTCT ATCGACAAGT CTGGCAAATC CAGGCAATCA CGCCTTGCGT TACGAAGTGA TTATTATCGG CGATACGTTC GATGCGCCGG TGCGCATCGC GCGGTTGCTG GCTGCTGCGC CTTTGGAAGG TGAATGGCGG TTGGCGCTCG ATCTGGCACG CGGCGCTTCC GAAGCGCGAA TGAATGGCGC TCCTGCACCG ATGCTGGCAG CCGATGTCGC CGTAAATCCT CCCGACGGTC GCTACTTTGG CGTTCTGGCG ATCTATAGCG GCGGTGCGGT CGTTGCGCAG GCGCCGCTTT TTACCATGAC CATGAGCGAG GGCGCCGTGG CGACCTTCGA GCCGGTCTTC TTCTCGGTCG AAACTGCCCG CGCCCGATCT GACGCCTCGC CGCTCCCCGC GCATCAGCGC GCACTTCTCG CCGGAACGCC GCTGATGTGT GACGAGTTGC GCCTGGCGCT GGAACAGATT GTTCTGGAGC GCCAATCACC CCCGCCTGGC GTGACTCCTG TGACGCCACT CTCCCCCGGT GAACGTCTGA ACGTTCAGGT CTTCTGGCGT GCAACCGGCG ACCGTGAGAA CCAGGATCGG TCACCAATGG TATCGTTCCA GGTGCTGGAT GATGAAAACC GCAAATGGGC GCAGTGGGAC GGCGTACTCG GCGATTGGCT TCCTGTACCT GCCTGGAAGC CCGGTGCAGC AGTGCGGCAG GACATCCCGT TGACGCTCGA TGCCGCCACG CCGCCTGGCG ATTACCGCCT GTTGCTCATT GTGTACGACC CATCAACCGG TCGTCCCATT CTGGTTGCCG GACAGGAAGC CGCAGTTGTC GGGAAGGTGA GGGTTGCGGC AAGCGGGGGG ATAGATCCTT GA
|
Protein sequence | MSFIAELSLI LIAAAVLCCY VGWGAARLAL PPSLASFRAP LTPLIGYVVL LWSGFMLASL VLNLRWTVAV ILIGATVLNI LTWRAEGPPQ PLAWLRAQPE ALIPPLLALL TGILPLLEYG YPTIIGRGWD TEAYLPMAQH LIDYSLPRIP EAPQSLLRDL VTHPPRIGLT LGFSIFHGMT MIFSGASALA SFAPVIAFMR ALAVLAMYVW LRATMDAGRV GSFLGATLTA LTSLMLWIGY FNFGMQMSAW CLLALALTTG LAAVDDLAQR RLAAWRGALL AAIALAAIPI AYYPALVIAV PLISAAGAAR LFETWRHPQP YTTPISLALA ALALAGLTLA ASALAVQDYF EGFSFRYSLI EPKIGPDRFI GVDEILGLTA FRLSNDGDQP PSLLIGVALL ATVLPGCAAL VLPHRWRNDA DAGERTRLRW TLTIAAVAAA LIWLRFGRPY EYGFMKGAAY TSFVIWGLTA SGVERIAQWT KRTGMLLASS AALLILACTG WSQSLTVADH IRGPAIFTRD IAAFDRVAAQ LPHGATVLLS GDETLTGPIN GMLATMLYGK ELWGRVPAAY AAQSFWSPGE TPNYVVLAAR EDPWPLDVGA KERWRSSAIA LYEMPPDATF VLGRSESYVI AAVDPKSPAS LAIWRRAGHN RVIAPNEPFT LEMPHAATLR LTLAALEAQT VMLRQGHTTT TLSLEAGVTT IKTGSSSTVQ VIPTAPLALV HAVVSPTDTP TPVSTSLDIT RVAWSATSEQ QGDQIVLSTS LANPGNHALR YEVIIIGDTF DAPVRIARLL AAAPLEGEWR LALDLARGAS EARMNGAPAP MLAADVAVNP PDGRYFGVLA IYSGGAVVAQ APLFTMTMSE GAVATFEPVF FSVETARARS DASPLPAHQR ALLAGTPLMC DELRLALEQI VLERQSPPPG VTPVTPLSPG ERLNVQVFWR ATGDRENQDR SPMVSFQVLD DENRKWAQWD GVLGDWLPVP AWKPGAAVRQ DIPLTLDAAT PPGDYRLLLI VYDPSTGRPI LVAGQEAAVV GKVRVAASGG IDP
|
| |