Gene RoseRS_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4021 
Symbol 
ID5211004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5030899 
End bp5032335 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content57% 
IMG OID640597610 
Producthypothetical protein 
Protein accessionYP_001278316 
Protein GI148658111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCC TCAAGTCCCT TGCTCAATCT GCCATGCTGG GTTCGACCGA CGAGTCGCTG 
CTCGCTTCGG CGATCCGTGT GGGGATCGCT GAATTGGGAG GATATGTCCC GGTAGCTTTT
ACAGGAGAAA GGGCAATCTG TGGCACAGAA AGCCGGCCGC GAATGCCGGA GAGAGCGGGG
AGATTGTTCA AACGCATGCT GGAAGAAGAA ATGGATGAAC TGCTACCGGA GTTTTTGCAA
GGGGCCGCCG AACGCGGCTA TATCGTTCCC CCCGAAACCT TGCCTGCGCT CCTGAGTCCT
GGCAGAAACG CATGGCATTC CCTGATCCTC CCCGTCATCG GTGAACGCGG GAGCTGGCTG
GCCGCGCACA ATCCAGCCTG GGACTATGCT CGCAAACGTG AGCCACTAGA AGCCTGGGAA
AACGGAACTC GCGCCGAGCG GGTCTTCGCC CTGGAACGCA TACGTACCAC TGACCCGGCC
AGAGGGCGCG AATGGGTACA GACCTGTTGG GAAACAGATT CACCCGAAGA CCGTGCCGCG
TTCCTGGCTA CCTTTGCGAT CGGATTGAGC ATGGACGATG AACCCTTTCT CGAAGCCTGC
CTTGATGACA AACGCAAAGA AGTGCGCCAG GCGGCGCGCA GATTGCTCCT GCGGCTGGAA
ACATCACGCT TCGTGAAACG GATGTGGACG CGGGTCAAGC TGATCGTTCG CGTCCGCTCC
TCTTTTCTAA ACAAGGGAAC GTTGGAAGTG ACTTTGCCCG AAGAACTGGA TTCCGCTGCA
AAACGGGACG GTGTGGGTGA GCTTGCGTTG CCCAAAAAGA TGGGTGAAAA AGCCAGCAGA
CTGGCGCAAC TGATCGCACT GACCCCGCCA GTGTTATGGA GCCGTGAATT TAACCGTTCA
CCTGACCGAT GGATCGCCAT GGCGCTCACC TGCGAATGGA AAGAGCCGCT CCTTCTCGGC
TGGCAAATGG CTGCCATCGG AACAGCAGAT GCAGACTGGG CCGAGTCGCT GATCCTGCTA
TGGATGACAC AGGAGGAGGC ACAATCTTTC CTGGATATGA ACGCGCTTTC GGCACTGGTT
CCTCTCCTGC GCGCCGAAAA AATTGAGGGG TGGGTCACCT CTATCGAACC CGGTGTCAGC
GATACCCGCA GCACAAGAGC AATCGCGTTG TTAGAAATGT ACCATCGGCC ATGGACAGCA
AACCTCTCGC GCTGGGTCGT GAAGAGCGTA CAGCAGCAAT CCACCGTTTT CCATCAGCGC
TTACTCCAGG CGCTACCCGG CTTTGCCCGC TGGATGCCTC CTGAACTGTC CGATGAATTT
TCGCAGGGTT GGGAAGATGA ACCTGGAAGC GGCTGGAACA AGAAAATCGG GTCCTTTTTG
CAAATCCTGA AGTTCCGTAA CGAGATCAAA TCCAGCCTGG AGGAACAATC ATCATGA
 
Protein sequence
MTTLKSLAQS AMLGSTDESL LASAIRVGIA ELGGYVPVAF TGERAICGTE SRPRMPERAG 
RLFKRMLEEE MDELLPEFLQ GAAERGYIVP PETLPALLSP GRNAWHSLIL PVIGERGSWL
AAHNPAWDYA RKREPLEAWE NGTRAERVFA LERIRTTDPA RGREWVQTCW ETDSPEDRAA
FLATFAIGLS MDDEPFLEAC LDDKRKEVRQ AARRLLLRLE TSRFVKRMWT RVKLIVRVRS
SFLNKGTLEV TLPEELDSAA KRDGVGELAL PKKMGEKASR LAQLIALTPP VLWSREFNRS
PDRWIAMALT CEWKEPLLLG WQMAAIGTAD ADWAESLILL WMTQEEAQSF LDMNALSALV
PLLRAEKIEG WVTSIEPGVS DTRSTRAIAL LEMYHRPWTA NLSRWVVKSV QQQSTVFHQR
LLQALPGFAR WMPPELSDEF SQGWEDEPGS GWNKKIGSFL QILKFRNEIK SSLEEQSS