Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1000 |
Symbol | |
ID | 5538466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1307187 |
End bp | 1310051 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640893143 |
Product | hypothetical protein |
Protein accession | YP_001431126 |
Protein GI | 156740997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.330907 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATGC AACGTCGATG GCAACCTGCG CTCAGAACGA TCCTGTCGTT CGTCTTGTGC CTGCTGCTGT TCCTGATGTC TGCGCCGGCG CAGGCGCAGA CGCCGCTCCC GCCGCTGGTC TTTGTGGCGC GCAGTCGTCT GGCGACCGGT GATTATCTCT TTCCACGCGA TGTCGGTCCG GCAGGGCATT TGATCACCGG CATGACGAAA TTCGCGCCCG GATCGAAATT ACTCATCCGT GATCAGGCAG GTCAGTTGCG TGTTCTGATC GATACGGCGC GACCGGCGGG TGATCCGCTG AACCCGCTGG GGTTGCGCGA TGTGCAGTCG CCTGATGTGT CGTTCGATGC CCGCCGCATT GTGTTTGCCG GCACGTTTGG ACCGGAAACG TTTCGCAATC AGCCGAACGG TCGCCCGTAT TACTCGTGGC GACTGTTCGA GATCGGCGTC GATGGTCGTG GCTTGCGGCA ACTCACGCAC TCTGATCGCG ACATTACGAT TCCTGAGGGA CCGGCAAATG CGGAGGCGTA TGCGTTCTAT GACGATCTCT TTCCAGCATA CCTGGCGGAC GGGAGGATTG TCTTCAGTTC GTCGCGCTAT CCGGCGCGTT CACCCTACGA TGGTCGGCGG TCATTCAATC TCTATATGAT CGACGGTGAT GGCGGCAATA TGCGTCGCCT GACGACCGAG CGCTCTGCTG CGCTCCATCC GGCGCCGCTT CCCGACGGCA GGATTGTCTT CAGTCGCTGG TGGGTCAATT TCAACCAGCC GAGTGAGCGC GGTATTTACA ATCGCATCGA TAACCGCGCG GGCATGGAGA TTGCCCGTGA CCAGAGCGGA CGCCCGATCA TGGTCGAACG ACGCATTCAG GTTGTCGAGA CGGCACCCCC TGCTCTGGCG GCGTCACAAC CGACCGCGCC GCCGAAACCG ACGCCACTAC CGACCATTTC GTTTGTTGAG AAGATCGATC CTGCAACCGG CAGCATCGTG CGCCTGACCA AGACTCCGGC TCCGCCGACG CCAACGCCGC GCGCGCGACC GACTGCCACG CCAACCGCAG CGCTACCGGG TGGCACACCC CGCACGATTG TCGTGGAACA GCCGATCACC GGGTATCGTC TGCCGGATGG GACGCTGGTC TATTCCAACA CCAACACGAC CTTCAACCCG GCGCGTGGTC GGCTGGCGGA CGGCTTCCCC ATCCGTGATG CGCCGAACAC CTGGCATCTG ATGGCAATCG AAGCCGACGG CAGCGGCATG CGGCGCTTCG CCTGGACGCC GCGCTACGCA TCGGCGCTTA CGAATGATAG CGGCCTCGAT ACGTACAACG CCGTGCAACC GGCGGTTGTG CTGTCCGGCG GCGAACTGCT GATAGCATAC ACGACGCAGC GCGATCAGAC GATGGCGCAT TCGACGCTCT ATACCGGCGT CCGCGTTGCG CGCCCCGGTA TCGAAAACAT GGCGCTGAAC ACCACCGAGT CGATTGCCGG GTATCGCTGG GACGACGGAA CCGCTTTTCG CCCGCCGTAT GCGCTGGCGC CCGCCGGATT GCCTGACGGA CGAATCATCT TCGCACAGAC CGCCGCCATT TCCGCACCGG CGCGAACCGG CACATACACG GACACCCGCA ATGGGCGCAC GATCACGCTG CGGTTACAAT CGTCGTCGTT GCGCTACGAA CTGCGCACCA TCTATCCGAA CGGCGCCCAA AACGAGGTTG TGCCGCTCGC CGGCCTCTCC GATGACTACG ATGCCGTCGA AGCCAGACCA ATCGTGGCGC GTCCGGTCGG CGACGGACCG GGCATGTGGC GTCTGCCGCG CGGGACGCCG CCGCCGGTCA GTGATGATCC GCTGGAAAGC AATGTGCCGT TGGGTCTCCT CGACACGTTT GGCAACCCGG CGTATCCCTG GAGTCGCCGG AGTATCCAGA GCGTCGAACT CGTCGCCGTG CGCAACGCCA ATGTCTATGC CAATCCGCCG CTCGAATTGC CGTTCATTAA CAACTCGCCG CCGCCGGGGA GCGTGGCATT CGCCGACATC TACATCGATG CCAATCAGTT TGGCGGCGCC ACATCGCGCG CTCCCAACCC GGACGATCAG GCGCGCGCCG TCAAATGGTT GACCGTGCCG GTGAACCCGG ATGGGTCGTT CATCGCCTCT GCGCCTGCCG ATGTGCCGAC GTTCATTGTG CTGCGCGACA AAAGCGGGCG GATTGTGCGC GGCGGCAATC GTCACACCCT CAGCATCGCG CAAGGGAACT CCGCCGGCCG TCCCGGACAA CCGATGTTCT GCATTGGTTG TCATATGGGG CACGCAAGCG GTTCCATCGC CAACCCGTCG CTTGCCGAGC GCGGTTGGAC GAACATCGCT CCGGCGGCGT CCATTGCCGC ATCGTCATCC ATCGAAAACG GCGCACCGGC GCGGATCAAT GACCGACGCG GATATGTGAC TGCACCCAAC GGAACGCTCA TTGATCGCAC GCCGCCGTGG ACGGCGAACG GCGGTGCGGG GCAGTGGATT CGGCTGGAGT GGCAACTGCC GATGGCAATC CTCGAAGTGC GTCTCGTCGG CGCGGAACCG GGACAGGAAG GGCGCAGCGC CGATTACGAG GTAAGCGGCG AACTGCGCTT CTACCTGCGC GGGCAGGAAC TTGGCGGAAC CGCCAGAAGC GTTGAAGCGG TCGCGCCGCT CTCGCGCGGT GGGACTCTCA TTCGCCTGCC ACAACCCATC GCGGCTGACC GGGTTGAGTT TACGGTCACC GCCGTGCGCG GTGCGCAACG CGGCGCACCG GCGCCCGCTG CGCTCAGCGA AATCGAGGTC GCCGGTCAGG GCGCCACGCC GGATGCGCTC GGCGTCGGGC GTTAG
|
Protein sequence | MLMQRRWQPA LRTILSFVLC LLLFLMSAPA QAQTPLPPLV FVARSRLATG DYLFPRDVGP AGHLITGMTK FAPGSKLLIR DQAGQLRVLI DTARPAGDPL NPLGLRDVQS PDVSFDARRI VFAGTFGPET FRNQPNGRPY YSWRLFEIGV DGRGLRQLTH SDRDITIPEG PANAEAYAFY DDLFPAYLAD GRIVFSSSRY PARSPYDGRR SFNLYMIDGD GGNMRRLTTE RSAALHPAPL PDGRIVFSRW WVNFNQPSER GIYNRIDNRA GMEIARDQSG RPIMVERRIQ VVETAPPALA ASQPTAPPKP TPLPTISFVE KIDPATGSIV RLTKTPAPPT PTPRARPTAT PTAALPGGTP RTIVVEQPIT GYRLPDGTLV YSNTNTTFNP ARGRLADGFP IRDAPNTWHL MAIEADGSGM RRFAWTPRYA SALTNDSGLD TYNAVQPAVV LSGGELLIAY TTQRDQTMAH STLYTGVRVA RPGIENMALN TTESIAGYRW DDGTAFRPPY ALAPAGLPDG RIIFAQTAAI SAPARTGTYT DTRNGRTITL RLQSSSLRYE LRTIYPNGAQ NEVVPLAGLS DDYDAVEARP IVARPVGDGP GMWRLPRGTP PPVSDDPLES NVPLGLLDTF GNPAYPWSRR SIQSVELVAV RNANVYANPP LELPFINNSP PPGSVAFADI YIDANQFGGA TSRAPNPDDQ ARAVKWLTVP VNPDGSFIAS APADVPTFIV LRDKSGRIVR GGNRHTLSIA QGNSAGRPGQ PMFCIGCHMG HASGSIANPS LAERGWTNIA PAASIAASSS IENGAPARIN DRRGYVTAPN GTLIDRTPPW TANGGAGQWI RLEWQLPMAI LEVRLVGAEP GQEGRSADYE VSGELRFYLR GQELGGTARS VEAVAPLSRG GTLIRLPQPI AADRVEFTVT AVRGAQRGAP APAALSEIEV AGQGATPDAL GVGR
|
| |