Gene Rcas_4062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4062 
Symbol 
ID5541573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5271338 
End bp5272999 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID640896174 
Producthypothetical protein 
Protein accessionYP_001434112 
Protein GI156743983 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0240971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.050702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCT CGCTGGACAT TGAACTGTCG CTGGCTGAAG ATAGCGCTCT GCCGGTCGTG 
ACCGAGCAGC AGCCGGTCGA ATTCGTCTGC CTTCCGCCGG GGGAAGCGGC GCTTGTGTTG
ACCGTCGATG GGATGGCGCT GACGCCATTC CTCCGCCCTG GTGATGCGCG CTGGCGCTGG
GTATGGAATC CTGGCGCCAG CGTAGGCACG CACACGCTGC GTCTGACGGT ACACATCAAC
GGCGAGGCGT TTGGCGAATG GCAATGGCGA CTATCGGTAG CACCACGCAA AATCGACAGC
GAACGATATG TGGCGATCCT TGAGGATGTC GAGCGCGTCG CTCCTCCACT GGTGCGATCC
CTGGCAGGCG CTGCCCGCGA TGCGCGCCTG GCGGGGCGCA GTGACCTGCC GCGTCTGTGG
TTCGACGACG CAACGGTATT GTTTGGTTCG TCATTCGAGC CGTTCGAGCA GACGGTCCGG
CGCATTCTGG CGCATCCCCG CAGTCTTCTG CGGCGTGAGG AAGAGCATGT GCCGCTTGGA
CAGGCGCGCG AACTATCGGT GGCAGCAGTG CAACGCGCGG TGCAGGGCGA CGTGGAAGCG
GCGCCGTCGG ATGCGGCGCC GCAGGTGCAG CGGCTGCTGC GACCGGAAGG CGGCGCGTTG
CCGCGCACCA TTGTGCAGGA TCTCAGCCGC GACACGCATG ACACGGCGGA ACATCGTCTG
CTGAAACGTA CCCTGGAACT TCTACGCGGG CGCGCGCGGC GGTGCGCAGA TCGGGCGCAG
CGCGAAGTCG CCCGTCTCGA TGCCGCTGCA CCGGCTTCCT CCCGCGCCGC GCGCGCCCGC
GCCATTGCCG CGCGCTGCGA TCAGTGCGCC CAACGGATCG CCGGTCTGCT GGGTGCGCCA
TTCTTCGAGC AAGTGACGGC GCTGCGCCAT GCCCCGGCGG TCACGCCGTT GATCCGGCGT
GATCCCGCGT ACCGTCAGGT GTATCGGATG TGGCGGGCGC TGCATCAGGG GGTGGTTGTC
GATCCCGGTG CGCCGTTCGA TCTGCCGGTT GTTGATCTTC CACTCCTGTA CGAACGCTGG
TGTGTGTTGC AGGTCGTTCA GGCGCTGCTG AACCTGGGCA TCGAGGCGCA TTCCTGCTCG
CTGCTGCTTC CTCCTTCGGA CGATGCCGAT GCCTGGTCCT TCAATTTTCG GCGTGATGAG
CCACTGTTGG TTGCCACGTG GAGTGGTTGG ACGCTGCGAC TGCGCTACCA GCCGTGGTAT
CGTCCGGCGC CGGGTAGTCG CAATGATGAA ACGCTCGTCT CACTCGATTG CCACACGCGC
ATCCCCGACA TTGCCATCGA AATGGTTCGG CCCGATCATC CGCCAGGAGT CATTGTGCTG
GACGCCAAAT ATCGCCTCGA TGCCGATGGG CGCGGCGTTC CCGCTGACGC TCTGGCAGAG
GCGTATGCGT ATGCCGGCGC CATCGGCGTT GCCGGCGCTC CCGCTGTTGC TGCCGCGTTC
ATCCTCTATC CCGGCACAGG CGTTGCTGAA CGCTACCCCG GCGGCGCCGG CGCCATCCCG
CTGCTCCCCG GCGCCGTCGG GACGCTGGAG GGGGTATTGG CGAGGGAGAT GATTCTCAAG
TGTGGCACTT TTCAGGAACA AAGGGTGTTT GCCACGTCCT GA
 
Protein sequence
MTFSLDIELS LAEDSALPVV TEQQPVEFVC LPPGEAALVL TVDGMALTPF LRPGDARWRW 
VWNPGASVGT HTLRLTVHIN GEAFGEWQWR LSVAPRKIDS ERYVAILEDV ERVAPPLVRS
LAGAARDARL AGRSDLPRLW FDDATVLFGS SFEPFEQTVR RILAHPRSLL RREEEHVPLG
QARELSVAAV QRAVQGDVEA APSDAAPQVQ RLLRPEGGAL PRTIVQDLSR DTHDTAEHRL
LKRTLELLRG RARRCADRAQ REVARLDAAA PASSRAARAR AIAARCDQCA QRIAGLLGAP
FFEQVTALRH APAVTPLIRR DPAYRQVYRM WRALHQGVVV DPGAPFDLPV VDLPLLYERW
CVLQVVQALL NLGIEAHSCS LLLPPSDDAD AWSFNFRRDE PLLVATWSGW TLRLRYQPWY
RPAPGSRNDE TLVSLDCHTR IPDIAIEMVR PDHPPGVIVL DAKYRLDADG RGVPADALAE
AYAYAGAIGV AGAPAVAAAF ILYPGTGVAE RYPGGAGAIP LLPGAVGTLE GVLAREMILK
CGTFQEQRVF ATS