Gene Rcas_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1501 
Symbol 
ID5538976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1916983 
End bp1918545 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content61% 
IMG OID640893639 
Producthypothetical protein 
Protein accessionYP_001431613 
Protein GI156741484 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.53215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.500126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCACG ATGGACAGCA CGAGGAAGGA AACCGGTCGT TTCCCGGCGC TCTGGCGCTC 
ATACTCACGG GTCATATGCC ATACCTGCGC GCTGCCGGTC GTCGTCCCGA CGGCGAAGAC
CCGTTGCATG AGATGATCGC CTGTTCGATC ATTCCCACGT TGAATGTCCT GTATGATCTG
CGCGAACGTG GCATGCGCCC ATACGTGTCG CTGGCGTATT CGCCAGTGCT GCTCGAACAA
CTGGCGGACA TCGTCGTACA GAAACATTTT GTTATCTGGA TGGAACGCTG GCTGGCGCGC
TGCGAAGCGG CGCTGCGGCG CTGGCAGCGG CAGCGGCAGC GGCATCAGGC GTATCTGGCA
CGTTTTTATC TCGACTGGGG TCAGGGGATT CTTCACAGTT TTACCACGCG CTACGGGCGC
AATCTGGTCG CGGCGCTGCG CGAGTTGTGT GCGACCGGAA CGGTCGAGCC ACTGGGAGGC
GCCGCAACGC ATGCATATTT GCCACTGCTG TCGCGACAGG AGTCGGTGCG CGCGCAGCTC
GATATCGGAA CTCTGACGGT GACCCGTCTG CTCGGTCGCC GTCCGCGCGG CGTCTGGCTG
CCTGAGTGCG GTTTTCGTCC GGGATTGGAG CAGGTGCTGC GGTTGAATGG TACGCGCTAT
TTCATTATCG ATCCGGCAAG CGTCGCCGTC GATGCCTGTG TCACCCACCT GCGTCCGCGA
TGGGTGATGC CGCGTCGTCT GATCGCACTG CTGCGCGCGG TTGATGCGTC GCTTCAGATC
GTCTCCCCCG CCATTGGGTA TGTCGGTGAT CCGTTGTATC TGGCGCCCCG TCGTGATCGG
AGCACGCATC TATCCATCTG GCGCAACGGA GACAGTGATA CCGTCATCGA GCCGTATGAT
CCGTATCACG CCTTTCGACG CGCTCAGGAA CACGCCATCC ATTTTGCTGA ATGGGCAGCA
GCCGAATTAC GCGCTTTCGC CAATCGTCAT GATCGTCCAG GGATGTTGGT AGTTCCGCTC
GATGCGGAGG TACTGGGTCG GCGCTGGTTC GAGGGTGTCG CCTGGCTGCG GACGCTCCTG
GAAACTGTTC TGATCCATCG ACCGTTTGCC CTGACAACGC CTTCGCCGTA TCTGCGCGCC
TTCCGACCGC GCCAGAGCAT CGTCCTGCGG GATGGATCAT GGGGTCCTGG CGGTGATCAT
TCGGCATGGA ATGCGCCGGC AGGCGCTCTG CTCCGTCGTG CCCTTGATGA AACGGAGGAT
CTCGTCGTTG GGGTGGTGCG GCGCTTCCCC GATGCCCGCG GCGATAGAGA ACGCGCACTC
AACCAGGCAG TGCGCGAATT GTTGTTGGCG CAGGCGAGCG ATTGGTTGTT GCTCCTCGGT
CGGAATGATG CCAGTGAGTC GCATCGTCCG TGGGTTCATC TGGCGCGCTG CCGGCAGTTG
TGCGCGCTGG CGGAGCGCGC CTCGCTCGAT GAGGACGATC AGCAGACACT TGCCGCTATT
GAAGAGATTG ACAATCCCTT CCCTCATCTC AATTATCGTA TTTTGACGGC AGAGACGGTG
TGA
 
Protein sequence
MSHDGQHEEG NRSFPGALAL ILTGHMPYLR AAGRRPDGED PLHEMIACSI IPTLNVLYDL 
RERGMRPYVS LAYSPVLLEQ LADIVVQKHF VIWMERWLAR CEAALRRWQR QRQRHQAYLA
RFYLDWGQGI LHSFTTRYGR NLVAALRELC ATGTVEPLGG AATHAYLPLL SRQESVRAQL
DIGTLTVTRL LGRRPRGVWL PECGFRPGLE QVLRLNGTRY FIIDPASVAV DACVTHLRPR
WVMPRRLIAL LRAVDASLQI VSPAIGYVGD PLYLAPRRDR STHLSIWRNG DSDTVIEPYD
PYHAFRRAQE HAIHFAEWAA AELRAFANRH DRPGMLVVPL DAEVLGRRWF EGVAWLRTLL
ETVLIHRPFA LTTPSPYLRA FRPRQSIVLR DGSWGPGGDH SAWNAPAGAL LRRALDETED
LVVGVVRRFP DARGDRERAL NQAVRELLLA QASDWLLLLG RNDASESHRP WVHLARCRQL
CALAERASLD EDDQQTLAAI EEIDNPFPHL NYRILTAETV