Gene Rcas_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3820 
Symbol 
ID5541323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4991048 
End bp4992982 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content62% 
IMG OID640895930 
Producthypothetical protein 
Protein accessionYP_001433876 
Protein GI156743747 
COG category 
COG ID 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.095196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTC TGACGCCGCT GGCGCTTCTC AGTGCACTTG TTGTGGGTCC GCTGATCGTG 
GCGATGTATC TGTTGAAACT TCGCCGCGAG GAACTGCGCG TCTCTTCGAC TTTCCTCTGG
CAGCGCATGG TGCGCGATGT GGAGGCAAAC GCGCCCTGGC AGCGCCTGCG GCGCAACTGG
CTGCTCTTCC TGCAACTGCT GCTGCTGCTC CTGTTGGCAA TCGCGCTGGC GCGACCCTTC
TTGCTTACCA CCGGCATCAG CGGGCGTAAC CTGATTATCA TCATCGATCG CTCGGCAAGT
ATGGCGGCGA CGGACGTCCC CCCCTCGCGG CTCGAAGCGG CGCGCCGCCA GGCGCAGACG
CTGGTCGATC AGTTGCCCGA AGGCGGGCGC GCGACGATTA TTGCCATCGG CGGGCAGATG
GACGTGCTTG CCGCTTCAAC GACGGATCGC CGCCAGATGT ATGATGCCAT TCGCGCGACG
ACGCTCAGCA TTGGTGGTCG TGGCGATTTG TCGCAAGCGC TGGCGCTTGC CACCGCTCTC
GCGGCGCGTG AACCGGATAG CGAGGTTGCC ATCATTTCCG ACGGCAATGT CGAGACTCCA
ACCGACATCC GTGTTCCGGC GACGGTGCGC TATTTTCCCA TCGGTCAACG CGCGGAGAAT
GTCGCTATCA GCGCTATGGC GCTGCAACCG ACACCCGCCG GACAGACGCT GTTTGTTCAG
GTCTCTGGCT ATGGCCCGGC GCCGGTTTCG CGGCGGCTTG ACCTCTACCT CGATGGCGCA
CTGTTCAATG CATACGAACT CAACCTCGGA CCAGACGGCA CTCCAGACGC TGTCCAGACG
GTGATCGTCG ATATTCCTGC TCAGGCGCGC GTTGCCGAGG CGCGACTCAG TCCGGCGCCC
AATGACGATT TCTTGCCCTC CGATGATCGG GCATGGGCAG TAAGTTCGAC GGGCGCAGGC
ATGGAGGTGC GTATTGTTGG TCCTGGCAAC CGCTTCCTCG AAACGGCGCT CTCGTTGTTG
CCCGGCATCA CTGCCACCAA AACAACGACC ACGACGGTTT CTGGCGATAC TGCACCACAG
GTGACAATCT TCGATCGGGT TGTGCCGGAA GCGCTGCCGA CCGGCAATCT GTTGTTCATT
GCTCCGATGC GCTCGACCCC CCTCTTTTCT GTGACCGGCA TGGTTGAATT TCCGCTGCTG
CGCCCGGCGC CGATCGTAAT CGAAGGGCAA GCGCCGCCAC TGCTGCGGAA TGTCAGTGTG
AGCGAGGTGA ATGTGCTGCG CGCGATGCGC ATCGAGACAG GCGTGTGGGC GCGCGCGCTG
GTCGAAGGAG ATGGCAGCCC AATGCTCCTG GCGGGGGAAC GCGAGGGGCG ACGCATTGTT
ATCCTGGCAT TTGCGTTGCA AGACTCCGAT CTGCCGCTTC AGGTTGCCTT TCCGCTGTTG
ATCTCGAATA TCATCGGGTA TCTCGCGCCG GGAAGCGGTC TGGAAGCATC GCAGATCGCT
CCCGGGCAAC CGCTGGTCGT GGCAGTTGAT CCCGCTGCCA CAGCGGTGCG TGTCGTTCGT
CCCGATGGGC GCGTCGATGC GGCACAGATT CAGGGTGGGC AGGCAATCTA TGCCGATACT
GATGCGCTCG GACCGTACCT CATCGAGCAG GTGCGCGATA ATCAGGCAGT CGAGCAGCGG
CGTTTCGCTA TCAATCTGTT TGCGCCGGAG GAGTCGCGCA TTGCACCGTC AGGTGAGTTA
CGCGTGCCAC AAGTCAGTGG TTTGCAACAG GCGGTGACCC GCGAGCAGGT GGGACGACAG
GAACTCTGGC GCTGGCTGGC GGCTGCGGCA ATCCTGATCG TTCTTATCGA ATGGCTGGTG
TACCAGCGCA GCAGTCTGGC GTACCTGCGG CAGCGCGTCC GTCTTGCGCT CGCAGCGCGT
CGCCATCCGG CGTAG
 
Protein sequence
MSFLTPLALL SALVVGPLIV AMYLLKLRRE ELRVSSTFLW QRMVRDVEAN APWQRLRRNW 
LLFLQLLLLL LLAIALARPF LLTTGISGRN LIIIIDRSAS MAATDVPPSR LEAARRQAQT
LVDQLPEGGR ATIIAIGGQM DVLAASTTDR RQMYDAIRAT TLSIGGRGDL SQALALATAL
AAREPDSEVA IISDGNVETP TDIRVPATVR YFPIGQRAEN VAISAMALQP TPAGQTLFVQ
VSGYGPAPVS RRLDLYLDGA LFNAYELNLG PDGTPDAVQT VIVDIPAQAR VAEARLSPAP
NDDFLPSDDR AWAVSSTGAG MEVRIVGPGN RFLETALSLL PGITATKTTT TTVSGDTAPQ
VTIFDRVVPE ALPTGNLLFI APMRSTPLFS VTGMVEFPLL RPAPIVIEGQ APPLLRNVSV
SEVNVLRAMR IETGVWARAL VEGDGSPMLL AGEREGRRIV ILAFALQDSD LPLQVAFPLL
ISNIIGYLAP GSGLEASQIA PGQPLVVAVD PAATAVRVVR PDGRVDAAQI QGGQAIYADT
DALGPYLIEQ VRDNQAVEQR RFAINLFAPE ESRIAPSGEL RVPQVSGLQQ AVTREQVGRQ
ELWRWLAAAA ILIVLIEWLV YQRSSLAYLR QRVRLALAAR RHPA