Gene Rcas_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4140 
Symbol 
ID5541651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5360198 
End bp5361937 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content59% 
IMG OID640896251 
Producthypothetical protein 
Protein accessionYP_001434189 
Protein GI156744060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.232862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00288889 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGGCA TAGCAGGAAT GCGCGCGTGG GCGCAGACGG CGGCACGAAT GCAGTCCGGC 
GACTGGGTTG TTCTGGCGCT CCTTGTCTGT GGAGCGCTGG CGATGATGTA TCCGGTGCTT
GCCGCGCCAT CATCTCGTAT CATAGGGTGG CCTGGCGACA ATATTCAGTA TGTCTATGCT
GCAGGATGGA TGGCTGAGGC GTTGCGTTCC GGTGCGTCGC CGTTTGTCGA TCCGCGCATC
AATGCGCCTC ATGGTCTGGC GTTGACTGCC ACCGATGTGC CTTATGTCGG ATACATTGCA
GTTGCGCCGC TGACCTGGCT GTTCGGTCCG GTGTTTGGGT ACAACGCGCA ACTTGCACTG
GCGCATCTCC TCTCAGGAGT GTGCGCGTAT CTTTGGGTCC GTCATCTTAC CGGCAGTCGG
ATTGGAGGAC TGACGGCAGG ACTGGCGTTT ATGCTGGCGC CGTTTCGTCT CGCGCATAGC
TACGGTCATC CGCAGATTGT CAGCACCTAT CCGTTGCCAC TGTTTTTCTG GGCGCTGGAT
TCGTCGCTGC GATCACAACC GGATCGCAAG ACGCTTGCGG GTCTGGTCGG TGCGACATTT
CTGCTCGGTG CGGCATCGCA GTACTATCTG GTGATCGGTC TGATCTGCGG GATGGTTTAT
GCGCTGCTGA CGCTGGCGAC ACGCCGGGTG AGTCTGTTAT CCAGGGTCTG GCTTGCGGTT
CCTGCTGTCT TTGTCGGAGC ATTGCTGGCG GCTGCCCCTT ATCTGATGAC GGCGCGCGAT
GGCATTTATA CACCTTACCA TCTCGACGTT GCTCGTATGT GGTCGGCAAG CCCGATGAAC
TTTGTGGCGC CCTCCCATCT TCACCCACTC TGGGGGACCT ATGTCGAGCG GTTGCGCCCT
GAGACGCTGT GGGGCGAAAA AACACTGTAT GTCGGCATTG TTCCCGGAAT ACTGGCTCTG
GCGGCGCTTC GCGCTTTTGA TCGCCGGTGG GTCTGGATTG GCACCGCGCT CGTTGCTGCC
GTTCTGTCGC TCGGCACCGA TCTCCACATC GGGAATGTTC CCCTGCATCG CGATCATCCG
GTCTGGCTGC CGGCATATTA TCTGCACCAG TTGCCGGGTA TAAATCTTAT GCGCGTATGG
GCGCGTTTCG GGATCGTGAC GATCCTGTTT GTTGCGTTGC TGGCGGGCAT CGGCGCTGCG
CGACTGGTTC ATCGAAAGAG TGTAGCGGGG CGTCTCACGA ATGGTTTCGG CGGCGCTGCG
CGCCTGAGAA TATCGTCTGC TGCATTGCTT TCCGGTGCGA TTGTCGCGTT GATAGTAGTG
GACTTGATGC CGGGGAGAAT GAACGAGTAC ACAACGCTGG CGCCACGCCC GATTGATCAC
TGGCTTGCCC GGCAGCCCGG CGATTTTACA GTTGGATTCG TCCCGGTTAT TGATGCGACG
ACCAACTACT TCATTTTGTT CGGCACGCTC ACGCATGGCA AGCGAACGAT CGCCTTTATG
CACCAGGCGC ATCTTCCGCC AATCTTCCAG GATTTCAACG AACGTTCGCG GGGATTCCCC
GACAGCGCCT CGGCGCAGCG ACTGCGCGAA CTTGGGATAC GCTATTTGCT GCTCGAAAAA
CCCATGTTCG ACGGCGCGCG CGCTTTCCGC TGGAGCGTCG TTGAGCAGCG GTTGGCAGAA
ACGCCGGAAT TGCGCATTGT GCGGGAAGTC GGCGATGTTG TGGTCGTGGA ATTTCGCTAG
 
Protein sequence
MAGIAGMRAW AQTAARMQSG DWVVLALLVC GALAMMYPVL AAPSSRIIGW PGDNIQYVYA 
AGWMAEALRS GASPFVDPRI NAPHGLALTA TDVPYVGYIA VAPLTWLFGP VFGYNAQLAL
AHLLSGVCAY LWVRHLTGSR IGGLTAGLAF MLAPFRLAHS YGHPQIVSTY PLPLFFWALD
SSLRSQPDRK TLAGLVGATF LLGAASQYYL VIGLICGMVY ALLTLATRRV SLLSRVWLAV
PAVFVGALLA AAPYLMTARD GIYTPYHLDV ARMWSASPMN FVAPSHLHPL WGTYVERLRP
ETLWGEKTLY VGIVPGILAL AALRAFDRRW VWIGTALVAA VLSLGTDLHI GNVPLHRDHP
VWLPAYYLHQ LPGINLMRVW ARFGIVTILF VALLAGIGAA RLVHRKSVAG RLTNGFGGAA
RLRISSAALL SGAIVALIVV DLMPGRMNEY TTLAPRPIDH WLARQPGDFT VGFVPVIDAT
TNYFILFGTL THGKRTIAFM HQAHLPPIFQ DFNERSRGFP DSASAQRLRE LGIRYLLLEK
PMFDGARAFR WSVVEQRLAE TPELRIVREV GDVVVVEFR