Gene RoseRS_2714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2714 
Symbol 
ID5209683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3371823 
End bp3374885 
Gene Length3063 bp 
Protein Length1020 aa 
Translation table11 
GC content60% 
IMG OID640596315 
ProductBeta-galactosidase 
Protein accessionYP_001277037 
Protein GI148656832 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204643 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAT TAACGGCGCT CAATACGCTG CCGCCGCATG CTCTGACCAT TCCTTTTCCA 
GCCCACGGTC CGATTGATGC CGATCCACTG GCGTCGCCGT GGCGCCAGAG TCTGAACGGC
GTCTGGGAGT TCCTGCTCTT GCCGCGCCCC GACCATGTGA CGGGTGCAGC GCTGGCAGGC
GGCGATTGGA AGCCGATCCA GGTTCCCGGC AACTGGACAA TGCAGGGGTT CGGAACGCCG
CACTATACCA ACGTGCAGAT GCCCTTTCCC CAGATGCCGC CATCGGTTCC CGACGATAAT
CCGACCGGCG TCTATCGCCG TCGGTTTACG CTTGCTCCCG ACTGGCGCGG ACGGCGGATC
GTGCTGCACA TTGCCGGGTG CGAGGGCGCG TGTTATGTGT ATCTCAACGA TCAACCCGTC
GGTTTTCATA AGGACTCACG CACCCCCGCA GAGTACGATG TGACCGGCGT GGTGCGCTTC
GATGCGCCGA ATGAACTGAT TGCCGTTGTG CTGCGCTGGT CCGACGCCAG TTTCATCGAA
GATCAGGATC ACTGGTGGCA ATCGGGCATT CACCGTGATG TGTTTCTCTA TGCCACCGAT
ACCGTCTATC TGGCGGATCT GTCAGTACGC GCGGATGTGA GCGATGATCT GCAGGAGGGC
ATACTTCGGG TGCGCTGCAC CCTTGATGCT ATCGGTGAAG CCGGGGAGCA TACCCGCGTC
GAAGCGCAAC TCTATGATGC GCACGGGACG CAGATGTTTG CTGAGCCGCT CGGCGCCACC
TATACGCAGA CCCATCCACG CTTCGGGGTG CGTCGCTTTG TGCGCCCGGA ACTGCTTCTG
GAGGGACATG TACCGTCGCC ACATCTGTGG TCGGCGGAAA CGCCATACCT GTACACATTT
GTCGTGACCG TGTATGGACC GGCTGGACCA GAGAGAAGTG CGTGCCGGGT TGGTTTCCGC
TCGATAGCCA TTCGTCACCG TCAACTGCTG GTGAATGGTC GGGCGATCAC CATCAAAGGG
GTCAACCGTC ACGATCATTC CGATACGACC GGCAAAGCAG TCAGTCGGGA ATTGATGGAA
CTCGACATTC AACGCATGAA GCAGTTCAAC ATCAACGCTG TGCGTGCGTC GCACTACCCG
AATGACCCAT ACTGGCTCGA TCTATGCGAC CGTTATGGTT TGTATGTGAT CGATGAGGCG
AATATCGAGG CGCACGCATT CTACTTCGAC CTTTGCCGCG ATGCGCGCTA CACGCGGGCG
TTTGTCGAAC GGGTGCGGAA CATGATCGAG CGCGACAAAA ATCATCCCTC GATCATCCTC
TGGTCGCTGG GGAATGAGAG CGGATACGGT CCGAATCACG ATGCTGCCGC CGGTCTTGCG
CGTCGTCTCG ATCCGTCACG ACCGCTGCAC TACGAGGGCG CCATCTCACG CTGGATGGGC
GAGTCGTGGC ATGGTGGACG CACTGTGACC GATGTGATCT GCCCGATGTA TGCCTCCATC
GAGGAGATTG TTGCGTGGGC TGAGCAGGAA ACCGACGATC CACGCCCGTT GATCCTCTGT
GAGTATTCCC ATGCGATGGG AAACAGTAAC GGCAGTCTGG CAGATTACTG GGAAGCGTTC
GAGCGCTATC CAGCGCTACA AGGCGGTTTC ATCTGGGAAT GGGTCGATCA CGGCATCCGT
GCGACCGATG CGCAGGGGCG CGTCTACTGG GCATACGGCG GCGATTTTGG CGATGTCCCC
AACGATGCCA ACTTTGTGTG CGATGGTCTG GTCTGGCCCG ACCGCACACC CCATCCGGCG
TTGTACGAGT ACAAGTATCT GATCCAGCCG GTGCGCGTCG AACTGGTCGA TCCGTCTGGA
ACGATGCTGC GGATCGTCAA TCGCCACGAT TTTGCCAGCA TCGATTGGTT GGACGGGGTG
TGGGAAGTGA TTGCTGACGG CGTGCCGGTG GCATCTGGCA GGTTGCCCGA ACTTCATGCC
GCACCGGGCG AAGCGCAGGT GGTGAAACTG GATCTCGACG CAGCGCATGG AGCGGGCGAA
CGTTTCCTGA CGGTGCGCTT CTACCAGCGT GAAGCGACTC TCTGGGCGCC TCCAGGGCAC
GAGGTTGCCT GGCAGCAACT CCCGCTTCCA ACGGTCGCCG CGATGCCTGA ACCGGTTATT
GCGGGCGAAT CTGTGGTGGT GGAGCAGCGT CCGGATCGTA TCACGCTGCG CGCTGGCGCC
ACGCACGCCG TGTTCGACGT CAGGAGCGGG ACTCTGGCAT CGTTTGGGCG CGATGAGCAA
AACCTGATCG TTCGTGGTCC GTTGCTCAAC GTCTGGCGGG CGGCAACCGA TAATGACGGC
TTGAAACTGC GGGACGAACC GGAGAAGCCG CTGGCGCGCT GGAAGGCGTT GGGTCTGCAC
CGGTTGCACC ATCGCCTGAA CCACATACGA GTGGTTGCCG TTGACAACGG GGCGGCGTCG
GTTGAAATCG AGCACGCCGC CACCGGTCGC GACCGTTGGG GCGATTTTAT CCATATCCAT
CGCTACACCC TGCACGCCGA CGGCGAACTA TCGGTAGAGA ACACCGTCAT CATCGGCAAT
GCCATCAGCG ATCTCCCGCG CGTCGGGGTA TGCATGCTAC TGACGCCTGG TTTGGAACAT
CTCGAATGGT ATGGACGCGG TCCGTGGGAC AACTACAGCG ATCGCAAGGC AAGCGCCTTA
ATGGGGCGCT GGCGTTCGAC CGTGACCGAC CAGTACGTGC CGTACATTAT GCCGCAAGAG
CATGGGCACA AAACTGATGT TCGCTTCCTG CTGCTGACCG ATCAGGACAG GCGTGGGTTG
TTCATCGGCG GACAGCCGAC CTTCGAGTTT TCGGCGCTAC ACCACAGCGA CGATGACCTG
TTTCGCGCCC TGCACACTAT CGACCTGACG CCGCGTGCTG AGGTCTTTCT CAATCTCGAT
GCAGCGCATC GCGGTTTGGG AACCCTGAGT TGCGGACCTG ACACGCTCGA ACAGCACCGT
TTGATGGACT CAGTGTATCG GTTTGGGTAT CGGATGCGGG CAGTGTCGTC GGATGTTGGA
TAG
 
Protein sequence
MPELTALNTL PPHALTIPFP AHGPIDADPL ASPWRQSLNG VWEFLLLPRP DHVTGAALAG 
GDWKPIQVPG NWTMQGFGTP HYTNVQMPFP QMPPSVPDDN PTGVYRRRFT LAPDWRGRRI
VLHIAGCEGA CYVYLNDQPV GFHKDSRTPA EYDVTGVVRF DAPNELIAVV LRWSDASFIE
DQDHWWQSGI HRDVFLYATD TVYLADLSVR ADVSDDLQEG ILRVRCTLDA IGEAGEHTRV
EAQLYDAHGT QMFAEPLGAT YTQTHPRFGV RRFVRPELLL EGHVPSPHLW SAETPYLYTF
VVTVYGPAGP ERSACRVGFR SIAIRHRQLL VNGRAITIKG VNRHDHSDTT GKAVSRELME
LDIQRMKQFN INAVRASHYP NDPYWLDLCD RYGLYVIDEA NIEAHAFYFD LCRDARYTRA
FVERVRNMIE RDKNHPSIIL WSLGNESGYG PNHDAAAGLA RRLDPSRPLH YEGAISRWMG
ESWHGGRTVT DVICPMYASI EEIVAWAEQE TDDPRPLILC EYSHAMGNSN GSLADYWEAF
ERYPALQGGF IWEWVDHGIR ATDAQGRVYW AYGGDFGDVP NDANFVCDGL VWPDRTPHPA
LYEYKYLIQP VRVELVDPSG TMLRIVNRHD FASIDWLDGV WEVIADGVPV ASGRLPELHA
APGEAQVVKL DLDAAHGAGE RFLTVRFYQR EATLWAPPGH EVAWQQLPLP TVAAMPEPVI
AGESVVVEQR PDRITLRAGA THAVFDVRSG TLASFGRDEQ NLIVRGPLLN VWRAATDNDG
LKLRDEPEKP LARWKALGLH RLHHRLNHIR VVAVDNGAAS VEIEHAATGR DRWGDFIHIH
RYTLHADGEL SVENTVIIGN AISDLPRVGV CMLLTPGLEH LEWYGRGPWD NYSDRKASAL
MGRWRSTVTD QYVPYIMPQE HGHKTDVRFL LLTDQDRRGL FIGGQPTFEF SALHHSDDDL
FRALHTIDLT PRAEVFLNLD AAHRGLGTLS CGPDTLEQHR LMDSVYRFGY RMRAVSSDVG