Gene Rcas_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4420 
Symbol 
ID5541933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5680114 
End bp5681841 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content60% 
IMG OID640896518 
Producthypothetical protein 
Protein accessionYP_001434454 
Protein GI156744325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.187576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.765326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGCA AAAACACCCG TCACTGGCAA ATTCCGCTCA CCAGCCATGT GGATGAACTG 
CCCGAACCGC TGCGCCGCCG CCTGGAGAAC TCCTGGGCCG GCACCTTCTA CCGCGAATTC
TTCTCCCGTC TGGACGAAGG CCCCTTTGCC GTGTTGTACA GCGACCTGAT TTCACGTCCC
AACGTGCCGG TGAATGTGCT GGTCGGGCTG GAATTTCTCA AAGCCGCCAA CGGCTGGACG
GATGAAGAGA TGTACGATCA CTTCTGTTAT GACGTCCAGG TGCGCTATGC CCTGGGCTAC
CGCCAATTGA GCGAAGGCTG CTTCGACTTG CGCACGCTGT ATTATTTTCG AGAACGGCTG
GCACAACATG CCCAGGAGAC GGGCGAAAAC CTGCTGGAAC GCGCCTTTGA GCAAGTGACG
GGCGAACAGC TGCGCGCCTT CTCCATCAAG AGCGGTAAAC AACGCATGGA CAGCACCCTG
CTGGCCTCCA ACATCCGCCA GATGGGGCGC ATCCAGCTGC TGGTGACAGT GCTGCAGCGC
GTCTGGCGCA TGCTCAGCGA AGCAGACCAG CAGCGCTATG CCCAACGCTT CGAGCGATAC
ACCAAAGGCC ACCCCGGGCA GTATATCTAT CGGCTGAAGA AGGAAGAGTG GCCGGAGCAT
CTGCAGCGCA TCGGGGAAGA CATGCGCACC CTGTTGCAGG AACTGCAGGC CGCCTACGGA
GAACAGCCGA CCTATGCGGT GCTGGCGCGG GTGTTTGCCG AGCATTTCCG CCTGGAGAAG
GAGAAACTAC AGGTCAAAGA AGCCAGCGAA TTGAGCGCCC GCAGCCTGCA ATCGCCGGAC
GACCTGGAAG CCACCTACCG CGAAAAGCAC GGCAAATCTT CACGCGGGTA TGTGGTCAAC
CTCACCGAAA CCTGCGACCC GGACAACCCG CTGCAACTGG TGACCAAAAT CCAGGTCGCG
CCCAATGTCA CCGATGACAG CGCCCTGCTG GCCGAAGCCT TGCCCGACCT GAAGGAACGC
ACCGGGCTGG AAGAACTCTA CACCGATGGC GCCTACGGCA GCGCCGAGAA CGATAAGCGT
TTGGCTGAAC AGGAGGTGAC GCTGATCCAG AGCGCCATCC GCGGGCGCAA GCGAAAGGAA
GAGCGGCTGT ACCTGGATGA TTTTACGCTG CCAGGCGACA CTCACAACGG AGCGCTGAAC
CTGACCTGTC CACACGCTCA GCAAGCGCCC GTGAGAGGCG CCAAACAGGG CAAATCGTAT
CGCGCCACCT TTGACGCGCA GGTTTGTGGG AACTGTCCCT TGCAGCCCCG GTGTCCGGTG
CAGCCTCGCA AAAATGGAGA AGCTGTACTG CTCTTCACGG AGGAAGACTT GCGCCGGGCG
CAGCGGCGGC GCAGGATGCG TCAGGCGGAC TCGGGAGAAC GGAACCTGCG TTCTGCCACT
GAAGCGAGCA TCCGCAGTCT CAAGCATCCC TTCCCGGCGG GCAAGTTACC GGTACGGGGA
CGTTTTCGAG CCGCCTGTCT GCTGATTGGT TCCGCCGCCG TGATGACCGT GCGGCGGATA
CACCGTTACC TGCAGAGCCA GATAGCAGGA AATCGGCCAG GAGAGCAGGC AAAAAGGATG
ACAAAACGCC TGGCAGAACA GGCGGAACAT GTTTTTTTTT TTGGCCGGAC GCTTTTGCAG
GCCTTTGGAC TTTACCGCCG AATCAACAGC CCGGTTTTGA CCTGGTAA
 
Protein sequence
MFRKNTRHWQ IPLTSHVDEL PEPLRRRLEN SWAGTFYREF FSRLDEGPFA VLYSDLISRP 
NVPVNVLVGL EFLKAANGWT DEEMYDHFCY DVQVRYALGY RQLSEGCFDL RTLYYFRERL
AQHAQETGEN LLERAFEQVT GEQLRAFSIK SGKQRMDSTL LASNIRQMGR IQLLVTVLQR
VWRMLSEADQ QRYAQRFERY TKGHPGQYIY RLKKEEWPEH LQRIGEDMRT LLQELQAAYG
EQPTYAVLAR VFAEHFRLEK EKLQVKEASE LSARSLQSPD DLEATYREKH GKSSRGYVVN
LTETCDPDNP LQLVTKIQVA PNVTDDSALL AEALPDLKER TGLEELYTDG AYGSAENDKR
LAEQEVTLIQ SAIRGRKRKE ERLYLDDFTL PGDTHNGALN LTCPHAQQAP VRGAKQGKSY
RATFDAQVCG NCPLQPRCPV QPRKNGEAVL LFTEEDLRRA QRRRRMRQAD SGERNLRSAT
EASIRSLKHP FPAGKLPVRG RFRAACLLIG SAAVMTVRRI HRYLQSQIAG NRPGEQAKRM
TKRLAEQAEH VFFFGRTLLQ AFGLYRRINS PVLTW