Gene Rcas_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4066 
Symbol 
ID5541577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5277082 
End bp5279229 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content61% 
IMG OID640896178 
Producthypothetical protein 
Protein accessionYP_001434116 
Protein GI156743987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.174208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCA TTACGCAACA CACACCAAGA GTTGCTGTAT TCCCACACCT AATTGCCCTG 
GCGCTCTATA CCCTGCTTGG CATTCTCCTG ACGTGGCCCT TACTGCTAAA CCTGACAAAT
GGCGTCATCG GCGCTGTTGA CGGCGTCGAT GCTTATCAAA ACGCCTGGAG TCTATGGTGG
ACGGCGCAGG CGCTGACCTC GCTGCGCAAC CCTTTCTTTT CGCCGCTCCT CTTCTACCCC
GATGGCGTCG ACCTCTTTTG GCAGCCGCTT GGGTTCAGCC AGGGAGTGCT GGCGCTGCCG
GTGACGCTTA CGCTGGGACC GGTCGCTGCG GTCAACTGGA TTGTGCTCAC CAGTTTTACC
GTTGGCGGAT ATGCAACCTT TCTGCTAGCG CGGCGCGTGA CCGGCAATGC AGCGGCAGCG
CTGGTGGCGG GCGCCACCTT TATCTGTTCG CCCTATCACA TGGAAAAGGT GATTGATGGC
AACCTGGAAG TGGCTGCCAT TCACTGGCTT CCGTGCTATG CCTATGCCCT CTTGGCGCTG
CTCGACCGTC CATCATGGCG TCGGGCGCTG GCGGCTGGCG CTCTGCTCGT CTGGGTCAGC
CTGGGCAGCT GGTATTACGG GCTATTCGCC GTGCTCTTCA CGGCGTGCGC TGCCGGTATC
TGGGCATACG GCGCCACCCG CAATGCAGAG CGCATGCATC AGTTGCAGCG CGGCTTACAG
CAGGCGATGT GGGGAGTAGC CCCCTTAGTC ATTTTGGTGA TGGCGATTGC ACCGGCGTTG
TATAGTCTGG TGACAACCGG AGCGGACGAG ATGCTGTGGG ATATGCGCTC GATACAGCGT
GAGCGCTCTG CCGATTTAAT AGATGCATTT CTGCCCAATC CGGTCCATCC TGTGTGGGGT
CCGGCGGTGC GCGCCTGGCG TAACCAGATC TATCCCAACG CGGTGATCTG GAATGTGTCG
CTGGGGTGGA TCGCGCTCGG ACTCGGACTG TTGGGCGCTA CTACTGCGTG GCGTGCCACG
TGGCGCTGGT CGCTGCTGGC GCTGGCATGC TTCATTGTCG CCCTTGGACC GGAGTTGAAG
ATCGCAGGCT GGCACACCGG TCTTCCACTG CCGTATACTC TCATCCAGGA TATGCCCGTC
ATTCGCTCAG GGCAGCGACC GAACCATATG ATGGTAATGG TCAGTCTGAG CCTCTCGATC
CTTGCGGCAT ACGGCTTTAC CGTGTTGCAA CAACATCTCA TACAACACCC CTCGCCAATC
CATATGTGGA GTATGGCGCT TGCATTGATC GTACCGGTTG CTGGTATTGA CGGATACGCC
GGGACCCACA CCATCGTCGC GCGCCGCATC CATCCATTCT ACGCCACATT GCCGCCTCCC
GACGGAGCAA TCATAGCGTT GCCGCTCTAT CTTAACGTCA ACCGTAGCGA GAACCTGACG
GCACAGATAG GTCATGGATG GCCCATCATC GGCGGGTATG TCGCCCGTCC GCCTGCATAC
GTATTTCCGA AGTATACCCC TGGTGTCCGT GAGATACAGT TTGGTGAAGT CGAAAGACAG
GACGTCGTAT CGCCTGGATG GCCCGAATCT GCCCGGCGAG CGCTGGCGGC GTACCGCATT
CGCTACATCA CCATGGACTT GCAAAGCAAT AAAGACGAGT ATTTTGCGCG CCTCCGCCCG
CTCCTCGCTG AGTTGGGGAT CGAGACGCCA GTCTTCGTTG ATGAGACGCT GGAAGTTTAC
GCTGTACCGC AAGCCTGGCG GGTCGTGCCG GTGGCGTTTC TGGGCGACGG GTGGCAACCG
CTTGAACGCG AACCGGCAAC CGGCGTTCGC TGGCGCTGGA TGGGCGAGCG CGCCGAAGTG
CGGCTGTTCA ATCCCCTCGT CGGCGCGGCG TTGGTGCGCC TGACTTTCTG GATGGAGGCG
TACTATGAGA CGCGACCACT CTGGTACACG CTCAACAATA TGGCGCTGGG AACAGTCACC
GTTCCTTCCG GGCGCGCGCC AGCGCGCGCA ATCTACGTGC TGCTTCCGCC CGGCGACCAT
GTGCTGACCT TGCAGGCGCC CGCTGATCCT GACCCGGCGC GCGCTGGCGC GCCGATCAGC
ATCCGTCTGT TTGCGCTCGA TGTCCGCAGC GCTGCCGGCG CGCCATAG
 
Protein sequence
MASITQHTPR VAVFPHLIAL ALYTLLGILL TWPLLLNLTN GVIGAVDGVD AYQNAWSLWW 
TAQALTSLRN PFFSPLLFYP DGVDLFWQPL GFSQGVLALP VTLTLGPVAA VNWIVLTSFT
VGGYATFLLA RRVTGNAAAA LVAGATFICS PYHMEKVIDG NLEVAAIHWL PCYAYALLAL
LDRPSWRRAL AAGALLVWVS LGSWYYGLFA VLFTACAAGI WAYGATRNAE RMHQLQRGLQ
QAMWGVAPLV ILVMAIAPAL YSLVTTGADE MLWDMRSIQR ERSADLIDAF LPNPVHPVWG
PAVRAWRNQI YPNAVIWNVS LGWIALGLGL LGATTAWRAT WRWSLLALAC FIVALGPELK
IAGWHTGLPL PYTLIQDMPV IRSGQRPNHM MVMVSLSLSI LAAYGFTVLQ QHLIQHPSPI
HMWSMALALI VPVAGIDGYA GTHTIVARRI HPFYATLPPP DGAIIALPLY LNVNRSENLT
AQIGHGWPII GGYVARPPAY VFPKYTPGVR EIQFGEVERQ DVVSPGWPES ARRALAAYRI
RYITMDLQSN KDEYFARLRP LLAELGIETP VFVDETLEVY AVPQAWRVVP VAFLGDGWQP
LEREPATGVR WRWMGERAEV RLFNPLVGAA LVRLTFWMEA YYETRPLWYT LNNMALGTVT
VPSGRAPARA IYVLLPPGDH VLTLQAPADP DPARAGAPIS IRLFALDVRS AAGAP