Gene RoseRS_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3722 
Symbol 
ID5210703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4658678 
End bp4660618 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content62% 
IMG OID640597317 
Productalpha amylase, catalytic region 
Protein accessionYP_001278026 
Protein GI148657821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0146408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCG TCCGCGATAT GTTAAGCCGA CCACGCCCGC CACGCATCAG GCATGATGTT 
CAGTTGCCGC GTCGGGTTGC ATACTACCCG TCACCGGTCG ACTGGCGTGA CGAGGTGATC
TACTTTTTGA TGGTTGATCG CTTCAGCGAT GGACAGGAAG ATACCCGCCC GTTGCTCGAC
CGGCGCTACC TGGCGGCAGC GCGACCGGCG CTGCCCAACG GCGACCCCTG GCGCTGGGAT
CGCTGGGCGT TGTCGGGCGG TGAACGATTT CAGGGCGGCA CGTTGCGTGG AATCATATCG
AAACTCGGCT ATCTGCAGCG GCTCGGCATC ACCACCCTGT GGCTCAGCCC GGTCTGCAAA
CAGCGCGTCC ACCTCGACAC CTATCACGGC TATGCCATTC AGGATTTTCT GGATGTCGAT
CCGCGCTTCG GCACGCGCCA GGACCTGGTC GATCTGGTGA GCGCTGCGCA TGAGCGTGGC
ATGCGGGTAT TGCTCGACAT CGTGTTCCAG CACACCGGTC CCAACTGGCG CTACCCGCCC
GATGTTCCCG GTGGCGCAGA CATGCCGCGC TATACGAGCG GGCGCTACCC GTTCGGCAGT
TGGGTCGATG CTGCGGGTGC GCCGCTCGTG GGCATTCCTG ATGTGAACGA TGCTGCCTGG
CCCGAAGAGA TGCGCACGAT CGACTATTAT ACGCGCGCTG GCGCCGGCGA TCTGGGCGCT
GGCGCTATCG ATGATCCGGA TGCCGAGCAT AAACGGTCGG ACTTTTTCAC GCTGCGCGAC
ATCAATCTCG ATGCGCCGGG CGCGCTCACC GATCTGGCGC TGTGCTACAA ATACTGGATT
GCGTTGACCG ACTGCGATGG GTTTCGAATC GATACGCTCA AACATGTCTC ATTCGAGCAG
GCGCGCAATT TCTGCGGCAC GATCAAGGAG TTCGCCGCCA ACCTGGGCAA AGCGAACTTC
TTCCTGGTCG GCGAAGTCGC CGGGGGCGAT TTTGCCGCAA CACGCTACCT CGACGCGCTG
GAGCGCAACC TGAATGCCGC ACTCGATATC GGCGAAATGC GCCTGGCGCT CGGAGATGTT
GCAAAGGGGC TGGCGCCAGC GCGCGCCTAT TTCGACGGGT TCGTGCCGGG GCTGGCAATC
ATGGGGTCGC ACCGCAATCT CGGCAGTCGC CATATCTCAA TCCTCGACGA TCACGACCAC
GTTTTTGGAA CAAAACTCCG TTTCTCAACC GATGTGATGT CGCAGCATCA TGCGGCGGCA
GCAGTCGCAC TGCAACTCTT CACGCTCGGC ATTCCATGCA TCTATTACGG CACCGAACAG
GCGCTCGGCG GTCCTGAACC ATCGGAGCGA CAGTGGTTGC CGGAGTGGGG ACGCGCCGAC
CGCTACCTGC GCGAGGCGAT GTTCGGTCCA CTCCACCCGC GCGCGTCCGG TCGCGCCGGG
ATCGACCCCC AGGCGCTCGA TACATCGTTG CCAGGATTTG GACCTTTTGG CACTGCCGGG
CATCACTGCT TCGACGAGCG CTTTCCAGTC TACCTGCGCA TCGCGGCGCT GGCAGCCCTG
CGCGCCGCCT TCCCGGTGTT ACGCCACGGT CGCCAGTATC TGCGCCCGAT TTCAAACTTC
AACCAGCCAT TCGCATTCCC GCCAGCCGGA GAAATCGTCG CCTGGTCGCG CATCCTCGAT
GACGAGGAGG CGTTGTGCGT GATCAATCCG AATGGTCTGG CGGCACGTGG CGGCGATGTA
GTGGTCGATG CCGCACTGAA CCGCCCCGGT GATACCATGA CGGTCATCCT GAATACCGCC
CAGGCCGCTG ATCCAGACGG CTATGACGGT CTGTATCCCA AAGGACGGCA ATTGACGGTT
AGAGAGCGGA ATGGAACGTC GTATGTTGAA ATTCGCAACC TGCCGCCAGC CGAGACGCTG
GTGCTGACAA ACCGACCATA G
 
Protein sequence
MTFVRDMLSR PRPPRIRHDV QLPRRVAYYP SPVDWRDEVI YFLMVDRFSD GQEDTRPLLD 
RRYLAAARPA LPNGDPWRWD RWALSGGERF QGGTLRGIIS KLGYLQRLGI TTLWLSPVCK
QRVHLDTYHG YAIQDFLDVD PRFGTRQDLV DLVSAAHERG MRVLLDIVFQ HTGPNWRYPP
DVPGGADMPR YTSGRYPFGS WVDAAGAPLV GIPDVNDAAW PEEMRTIDYY TRAGAGDLGA
GAIDDPDAEH KRSDFFTLRD INLDAPGALT DLALCYKYWI ALTDCDGFRI DTLKHVSFEQ
ARNFCGTIKE FAANLGKANF FLVGEVAGGD FAATRYLDAL ERNLNAALDI GEMRLALGDV
AKGLAPARAY FDGFVPGLAI MGSHRNLGSR HISILDDHDH VFGTKLRFST DVMSQHHAAA
AVALQLFTLG IPCIYYGTEQ ALGGPEPSER QWLPEWGRAD RYLREAMFGP LHPRASGRAG
IDPQALDTSL PGFGPFGTAG HHCFDERFPV YLRIAALAAL RAAFPVLRHG RQYLRPISNF
NQPFAFPPAG EIVAWSRILD DEEALCVINP NGLAARGGDV VVDAALNRPG DTMTVILNTA
QAADPDGYDG LYPKGRQLTV RERNGTSYVE IRNLPPAETL VLTNRP