Gene Rcas_3676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3676 
Symbol 
ID5541178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4807955 
End bp4810078 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content60% 
IMG OID640895796 
Producthypothetical protein 
Protein accessionYP_001433743 
Protein GI156743614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATTC TGTTATCGAT CCGGGTTGTC GGAGGTATTG TCGGTTTTTG GATCTGCTTC 
TTTCTCGGTA GTCGCACCGC ATATCATCCC TCAATGCGGT GGTTGCGCTG GTCATTTGTC
TTTGCCGGTT TCCATTTCCT CTTTGTCGCG CTTGAGTCTC TGCTCACATC CACGACATCC
ACCGCTATTT CACTGCTCTG CTGCGCAACA GGCCTGATCG CCATCGCCTT CTGGCATGGA
TGGCTCGTCG GAGTGCGCGG GCTTGCGCGC ACCGAGCGCA TCGTTCGCAA GGTTCACCTC
GGCGTGGCGG TCGATTACGC TCTGGTTGGC GTGGTTATCA GCCTCATTCC CGCAGATGGG
ACCTTCATCG CGCCAATCAT CGATCCCCTG GTTGCGCCGG TTCTTATTGG CGTCTACCTG
GCAGGCGCGC TCTCTCATAT CTGGGTGATG ACCTGGCGAC TGCATCATCG CGAAACGATC
CCGGTCGCGT GCGCAGCATC CAGAGCGGTG CTGATCGCAA CGACGATCAT CGGTGTGAGC
GTCATCGCGG TTGCCGCCGG TGCAGCACTG CGGCAATCGC TGCCAGAGGT CGTCCTCCTG
ATCGAGCATG CGGGGTATGG AGCGCTCAGT GGCGGCATGG CGCTGATGGG GTATGGCGCA
CAGAGTTATA CCGAACGCCA TATGGGACGA ACCAGAGGTC GTGATTACCT GTTCAGCGGG
ATTGCCAGCC TGGGTATCGT GGGCGTGTAT GAGGGTGTCC TGATCGCATT TTGGCTGGCG
ACATCGAAGG TCTGGCCAGC CCAGCAAGCG CTTGTTCTGG GGATCATCCT GATCCCTGTG
ATTGTTGCCA CACACTTCGG GTTCGATCAA CTGCGCGATA TGCTCGACTG GTTACGTTTC
GGCGCATCGG TGCGGCGGGT GCGCGGCACG TTGCGCACGA TCACGCGACA GATCGGCTCT
GCGAAGCCGC GCGATCAGGT GCTGCGCGAT GTGCTACAAG CGTTGGCGCA TGCGCTGCGA
GCGGAACGCA CTGCCATCTT CTGGTTCAAG CGGAGTGAAG CGCATCTGCT TGCCGCATAT
GGTCAGAAGC CGGACTCGCC GGTGCAGGCG GACGCTCTCT GCACCGTCCG GCTGAAACCG
CTCAACGGAT TTGCTGGCTA TGAGGACGTT CTGCCGCTCT GCACCGGTCG CAAACAGCAC
GGCGCCCTCC TGATTGGCGG TTCGGACTGC CGGCGGTGGA CGCTGGATGA ACGCGAACGC
CTGGAGGCGA TTGGGGTGCT GCTTGCCAAT TATATCGCGC ACACGGCGTC TGAACCGGTC
ACCATTCCGC AGCATCTTCA CCGTCTCGAA GAGCAGACGC GCGACGTGCA GGCGCTGCAC
GCCACACTGG ACCAAGGGCG CTCTCCATCG GTCTTCATCA CGACGCTCGG CTCATTTCAG
GTCGAGGTGC GGGGGCAACC GGCATCGTAC AAGGCTGTGC GTGTTGGCCG CCACATGCTC
AATGGCATGC TTATGTATCT GGTCGCCAAT GTGGACAAAA CTGTCAGGCG CGATGCGCTG
ATCGAAATTG CCCTCGATCA TCGCCGCGGT CGAAAGCCGG ACGATGTCTC GTCGCCGGAC
GGCGCGCACT ATATTTCCGG GTTGCGCAAG ATTCTGGAGC GCTGGGGCAT GGCGGATGCG
TTGGAAATCA GCGACACGAC GGTGATACTG AAGCGGCATC CATCGTGGAC TACCGATACG
GATCAGGTGG TCGAACGGTA TTGTCGCGCA AAACAAGAAA TCGCCAACGA TCGGATTGAT
CGTGCCATCC TGTGGCTCAA AGAGGCGCGC GCGCTGTTCA AGGGCGATTA CCTGCCGGAT
TTCGATGCTG CCGATTATCG TATTGAGGAG ACGCTCAAAT GGGAGCGTGA ACACGCAGAG
ATTGAACGTC TCCTCCTCAG ATGCTACGCA GATTGTCCAG ATGACGCCAT CAAGCACGAG
GCGCTCGACA CAGCGCACTC GATTCTCAGT CGGTATGAAG ACGATGGCGA CATGCTGCGC
AGCATCGAGC GTGTCGCGCA GCGCTTCCAG GATCACCGGC TGCTTCAGCG GTGTCGAGCG
CTCCTGTCGG ATTGCGCCAC ATAA
 
Protein sequence
MDILLSIRVV GGIVGFWICF FLGSRTAYHP SMRWLRWSFV FAGFHFLFVA LESLLTSTTS 
TAISLLCCAT GLIAIAFWHG WLVGVRGLAR TERIVRKVHL GVAVDYALVG VVISLIPADG
TFIAPIIDPL VAPVLIGVYL AGALSHIWVM TWRLHHRETI PVACAASRAV LIATTIIGVS
VIAVAAGAAL RQSLPEVVLL IEHAGYGALS GGMALMGYGA QSYTERHMGR TRGRDYLFSG
IASLGIVGVY EGVLIAFWLA TSKVWPAQQA LVLGIILIPV IVATHFGFDQ LRDMLDWLRF
GASVRRVRGT LRTITRQIGS AKPRDQVLRD VLQALAHALR AERTAIFWFK RSEAHLLAAY
GQKPDSPVQA DALCTVRLKP LNGFAGYEDV LPLCTGRKQH GALLIGGSDC RRWTLDERER
LEAIGVLLAN YIAHTASEPV TIPQHLHRLE EQTRDVQALH ATLDQGRSPS VFITTLGSFQ
VEVRGQPASY KAVRVGRHML NGMLMYLVAN VDKTVRRDAL IEIALDHRRG RKPDDVSSPD
GAHYISGLRK ILERWGMADA LEISDTTVIL KRHPSWTTDT DQVVERYCRA KQEIANDRID
RAILWLKEAR ALFKGDYLPD FDAADYRIEE TLKWEREHAE IERLLLRCYA DCPDDAIKHE
ALDTAHSILS RYEDDGDMLR SIERVAQRFQ DHRLLQRCRA LLSDCAT