Gene RoseRS_1597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1597 
Symbol 
ID5208552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1949051 
End bp1950859 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content59% 
IMG OID640595203 
ProductAllergen V5/Tpx-1 family protein 
Protein accessionYP_001275939 
Protein GI148655734 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAT TACTGCACCT GTTCCTTCTG GCTGGTCTTC TGATCGCTCT GGCGCATTAC 
GTGCCGTATT CACATCCGTC GCCATCTCGT GCACTGGCGC ACGCCGAACC CGCATATCAG
ACCGCCGATC TGCTCTACAA TGAAGCGCGT ACAGTCCACC TGGGAAATCT CGCCAGACGC
GCCAATGGCA TTCCGCCGTT GCGCTGGAAC CGCCAGTTGA CCGCTGCAGC GCGCTGGTTC
GCGTGGGACT CGACCGAGAA TCGTCCCATC GGGTATTGTG GCCACCAGGA CACGAACGGA
AACTGGCCCG TCTACCGGGC GCGCGCCTTC GGGTACCTGG GATTCGCAGG CGCAGAGAAT
GCGTTCTGTG GGTACGTTCC CCCCGAAGGC GCCATTGAAG GGTGGATGAA CAGTCCCGGG
CATCGCGCCA ATTTGCTCGA CCCCGCCTCG CGCGAAGTGG GACTTGGATA CTACCAACGT
TCCGGTGATG GGCGCGGTTA TATCGCGCAA AAATTTGGCG TCGATTCCGC CTATGCACCC
GTCATTATCG AAAACGAAGC GCCTTTCACT TCCAACCCCA GCGTCAACCT GTATATCTAC
AACCGGACGG AGCAGAGCGG TTTTGCGAGC ATGGGTCCGG CAACGCAGAT GATGGTCAGC
AACGATGCCT GTTTTAGCGA CCGTGCGTGG GAGCCGTTTG CCAGCCACAA GACGTGGACG
CTGGAAGGCG GTCAGGGCTG GCGCACCGTG TATGTCAAAA CGCGCGATGC ACTCAATCGC
ACCGCAACGG TGAGCGATAC GATTTACCTG GGAGAACCGC TGCCAGTGCA GGAACTCAGC
GACGCGCATC TCAGCACGAC GCAGCCATAT GTGACCCTGT ACCATCTCGA TGGCGGCGGC
CTGCCAATGG TGCAGTTCAG TCCGGGGTGG ATCGCCGATG ATACCTATCC GACATTTGGC
AGGTTGTGGG GAGCCGGTGA ACGTATCAGT GATCCCGACG CCTGGGGCGG CACAGCGTAT
CGACTCTTGC CGGGTAGCGG CGAGACATCG GCATGGGTGT GGGACACCAC ATTCATCAAG
GACACGCCGA TGACAGCATA CATACGGTTG AAGACGGGCA GTAACGTCTC GAATCAAGAA
ATAGCGCAGG TGACGGTCAA GGGCGGCGGT GTCGAGTATG GTCCGCGCCG ACTGCGCGGC
GTCGATTTTG CCGCGCCCAA CCAGTATCAG GAGTTCGCAA TCGATTTTAC GTTTCATTCG
AACCCGAACG AGCCATTTCT CATCTTCCAG TTCTGGCGTA GCGGCACTAC CGATATTTAC
GTAGACGCAG TGACCATCTT CAGCAGACCC CAACCGATAA CCAGTCCATT GACGTGGAAC
GTGCCGGGCG GTAATTATCG CGGTCAGGGC GTTTGGGTGC GCTACACGGA CGGTGTGCGC
TTTTCGGGGA TGGTCGAAGC TGCGACAACA CCAATGCAAC TCTTCGTATC CCCGACAGAA
CTGACATTCC TGGCAGCCAG GGATGGAACC CCTCCGCCTG CTGCATTTTT GCGGGTGGCG
CCGGTATGTG CGGCGTTTAC CTGGGAGATC AGCCATGATG CGCCCTGGTT GAATGCAGAG
AAAACAGGGA GCGGCGCCAG AATCAGCGTG AACCCGGCTG GACTGAGCAA CGGCATCTAT
TCAGGCAATG TCACGGTACG GGCAACCGGG GGCGCCAGTA CAGCGTCAGT TTCCGTTCCC
GTCAGATTGA TCGTCGTGGA TCGTTTGTTC CCGGTCTATC TTCCGCTGAC AGCCAGGGGC
TACTGGTGA
 
Protein sequence
MRRLLHLFLL AGLLIALAHY VPYSHPSPSR ALAHAEPAYQ TADLLYNEAR TVHLGNLARR 
ANGIPPLRWN RQLTAAARWF AWDSTENRPI GYCGHQDTNG NWPVYRARAF GYLGFAGAEN
AFCGYVPPEG AIEGWMNSPG HRANLLDPAS REVGLGYYQR SGDGRGYIAQ KFGVDSAYAP
VIIENEAPFT SNPSVNLYIY NRTEQSGFAS MGPATQMMVS NDACFSDRAW EPFASHKTWT
LEGGQGWRTV YVKTRDALNR TATVSDTIYL GEPLPVQELS DAHLSTTQPY VTLYHLDGGG
LPMVQFSPGW IADDTYPTFG RLWGAGERIS DPDAWGGTAY RLLPGSGETS AWVWDTTFIK
DTPMTAYIRL KTGSNVSNQE IAQVTVKGGG VEYGPRRLRG VDFAAPNQYQ EFAIDFTFHS
NPNEPFLIFQ FWRSGTTDIY VDAVTIFSRP QPITSPLTWN VPGGNYRGQG VWVRYTDGVR
FSGMVEAATT PMQLFVSPTE LTFLAARDGT PPPAAFLRVA PVCAAFTWEI SHDAPWLNAE
KTGSGARISV NPAGLSNGIY SGNVTVRATG GASTASVSVP VRLIVVDRLF PVYLPLTARG
YW