Gene Rcas_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3943 
Symbol 
ID5541449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5149413 
End bp5151518 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content60% 
IMG OID640896051 
Producthypothetical protein 
Protein accessionYP_001433994 
Protein GI156743865 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.627238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.614013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT TTGATAGTGC GCTGAAGAGC AATGTGCTCC GCTCCGGCGG GGCGACGACG 
CGATACGCCG TCATTCCGGC GCTACTGGCG GTCGGGTGGC TGGCAGCGAT TGCCGTCTTT
GGTTCGTCCG ACGTTCTTGT CTGGGCGCTT TCGAGCGCCG GAAGGCTGGC ATTGCCGGCA
GGCGCAGCGC TGGCGCTCTG GATTGTTCCA GGGATGGCAT TGATCAGCCT GCTCTGGCGC
GACCATACCT TGAGCGTCAT CGAGCGCATT GGCGTGGCGT GGGGGATCGG AGCAGCGCTG
CCGCCGATCC TGTTGCTTCT GGCGGATCTT CTCGATCTGC CCTGGAATCG TCTCACGACG
ATCATGTATG TGGTTCTTGC CGTGAGTGTC TGGCTGTTTG CAGTGTGGCG CAACACAAAG
GCGTCAGAAC GCGCGCCACA GCGTGATCCG GCGAACGGAC GGATTGCTCA AGGCTATATC
GTCCCCCTGA TGATCGGTCT CGTGGCGCTG GCGCTGATTG CCCGTCTCTA TACCACGCGC
GAGATGCTGG TTGGCTCCAA TGTGGACAGC TATCATCACA CGCTGATTGT GCAACTGCTG
GTCGAGCGAC AGGGGTTGTT CCAGTCATGG GAGCCGTATG CGCCACTGGC AACCCTGACG
TATCACTACG GCTTCCATGC CAATGCGGCA TTCGTTTCGT GGCTCACCGG CATCCCGGCG
ACACGCGCTG TCGTTGAGGT CGGGCAAATA ATGAATGCAG CCACGCTTCT CACTGTCTTT
GCATTGACTG TCCGCTTGAC CGGCAGCGTG ATCGCTGGCG TATGGGCGGC GCTGATTGTC
GGGTTCTATA ATACGTTGCC GGCCTTTTTG ACGTTTTGGG GGCGCAATCC TTTCGTTACG
AGCCACGTTA TTCTCGGCGC TGTGCTGATC GTCTGGATAG CAGCCATCGA GTCGCCGCGC
CACAATTGGC GATTGCTGGC GTTGGCAGGG ATCGTGAGCG CCGGGCTGTC GTTGAGTCAT
TATCAGACAA CCATCCTGGC AGCGCCGATG ATCCTGGTCT CTCTGCTGAC GCTGCGTTTG
CACGCTTCAC CCGGCATGAT GCTGGCGACT CTGGGACACG CCGCGACGAT TGGCGCTGTC
GCGCTTCTGC TGACGCTGCC ATGGCTGCTG CACGTCTCGA GTGGCTATCT GGATCGAAAT
GTGGTGCATG CCATGCGCCC CGAAGCCGCA GCCGGGCAGA TTCTGGCGGC AGCGGTACCC
GCTCTTGCAC CGCTTTATCT CAAAGGTCCG GTCATTGCAG CCGCGCTGAT TGGTCTGGCG
CTGGCAGCGC GACGACGCTC CTGGCATGCG CTCCTGCCCG CCGGGTGGGT GCTGGCGGCT
CTGGCGACCG CGATGCCGCA TCTGCTCGGA TTGCCCGGCA TCGGAGTGGT TCAGGGAGAT
GTTGCCGCTA TGATTCTCTA CCTGATGGCA ACGCCGCTGG CAGGCGTCAC ACTGGCATCG
ATCGGCGAAG TGCTGGAACG GGTTGCACCA CGACTGGCGA TCATTCCCAT CGCAACCGCC
ATCATCCTCG TCAGCGCCTG GGGTGTTGGA TGGCAACGCG ATCTGGTTCC AGCGTATATG
CGAATGGTTA CGCCAGCGGA TATGCAGGCA ATGGAATGGG TGCGCGCGAA TACGCCGCCG
GCTGCGCGTT TCATTGTCAA CAGTCATCCG ATCTATGGCG GTGATATGAT CGTCGGAACG
GACGCCGGAT GGTGGTTGCC GTTCTTTGCC GGACGACAGA CGAATGTGCC GCCGATGACA
TATGGGAGCG AGTTGAGCGT CGATCCGCAG TATGCATTAG ACATCAACGC ACTGGCGCGC
GACCTGCGAC GCCGACCGTT GACCGATGGC CGTCCTGTGG TGATTGATCT GACACTTCCT
GAAACGATTG ATCGGTTACG TAGTGCTGGC TATACCTTCG TCTATAGCGG CGCGCAATCG
ATTGCCGGTC CCGGCGGCAT TCCGGCGCCA GACCGTATCG ATACCGCCAG ATTGCGTACA
AGTCCGCACT TCCGCCTGGT GTACGACCGG GACGGCGTTG AAATCTTCGA GTTGGTGACG
CAATGA
 
Protein sequence
MKPFDSALKS NVLRSGGATT RYAVIPALLA VGWLAAIAVF GSSDVLVWAL SSAGRLALPA 
GAALALWIVP GMALISLLWR DHTLSVIERI GVAWGIGAAL PPILLLLADL LDLPWNRLTT
IMYVVLAVSV WLFAVWRNTK ASERAPQRDP ANGRIAQGYI VPLMIGLVAL ALIARLYTTR
EMLVGSNVDS YHHTLIVQLL VERQGLFQSW EPYAPLATLT YHYGFHANAA FVSWLTGIPA
TRAVVEVGQI MNAATLLTVF ALTVRLTGSV IAGVWAALIV GFYNTLPAFL TFWGRNPFVT
SHVILGAVLI VWIAAIESPR HNWRLLALAG IVSAGLSLSH YQTTILAAPM ILVSLLTLRL
HASPGMMLAT LGHAATIGAV ALLLTLPWLL HVSSGYLDRN VVHAMRPEAA AGQILAAAVP
ALAPLYLKGP VIAAALIGLA LAARRRSWHA LLPAGWVLAA LATAMPHLLG LPGIGVVQGD
VAAMILYLMA TPLAGVTLAS IGEVLERVAP RLAIIPIATA IILVSAWGVG WQRDLVPAYM
RMVTPADMQA MEWVRANTPP AARFIVNSHP IYGGDMIVGT DAGWWLPFFA GRQTNVPPMT
YGSELSVDPQ YALDINALAR DLRRRPLTDG RPVVIDLTLP ETIDRLRSAG YTFVYSGAQS
IAGPGGIPAP DRIDTARLRT SPHFRLVYDR DGVEIFELVT Q