Gene Rcas_4423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4423 
Symbol 
ID5541936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5686364 
End bp5687617 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID640896521 
Producthypothetical protein 
Protein accessionYP_001434457 
Protein GI156744328 
COG category[S] Function unknown 
COG ID[COG2311] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00407884 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.808497 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACG CAATAGCGCC AACCACAGCA CGTGAGCGAA TCGACCTGCT CGACATCCTG 
CGCGGATTTG CGCTGTTGGG TATCCTGATC GTCAACATGG GCATCTTCAG TTTTCCGTTC
ATTGCAGCAT TCACCGGAAC GCCACGCGGA GAAAGCACAT TCGACCATGC CGTTGAGTTT
CTTACCCATG CACTGGCGAC GGGCAAGTTC TACCCGCTCT TCTCCTTCCT TTTTGGGTTG
GGGATGTGGT TACAGATGGA GCGCGTGCAG GAAGCCGGTG GCGCCCCGGC GCGCTTTATG
GTCCGCCGGT TGTTGGTGTT AATGGGATTT GGCCTGGCGC ATGCCCTGCT GATCTGGAAT
GGCGATATTC TGTTCATCTA TGCGCTGGTT GGTCTGGTTG CCCTGCTGTT TCGCAAGGCG
CAACCGCGAA CGCTTCTCAT CTGGGCGAGC GCGCTGATCG CCATACCGAT CATCTTGAGC
GCAGGCCTCA TCGTTCTGGG CATTCTGGTG GCAGGTCTTG CTCCAACCAA TGCAAGCGGC
ATGGACGAGG TCATGACGCT CGTGCGCGAT CTGGAGCGGC AGGCTATCGA GACCTATGCG
CGGGGATCGT GGGGGCAAAT CTTTGCCTGG CGCGCCATTG AGTGGCTGAT CGTCCTGGTA
TTCTTCTTCT TGAGCGGAAA TGTCCTCCAG ATCCTGGCAA TCTTTCTGAT CGGCATGTAT
GCCGGCAAGC GCCAGGTGCT CCAGCGCCTG ATGCAGTTGC CGGCAAACGA GCGTCGCCTG
CCAGCCGGGA GGGTGTGCCT GGTCGTGGGG TTGGTCGCCA ATTTCGCGCT GACCTGGCTG
ATGTGGACGG TGGATATGAC ATCGCCGCTG GCGGGATTGC CGTCGGTCCT TCTGCTTATC
TTTGGTCCAG TGCTGAGTTA CGGTTACATG GCGGCATTCG TGGCGCTGAC CCGCCGGGAA
GCCTGGCATC GCCGCCTGGA ACCTCTGGCA GCCGCAGGAC GAATGGCGTT GAGCAACTAC
ATCGCACAAT CGATTGTCTG CACGCTCATC TTCTACAGTT ACGGATTGGG GCTGTTCGGT
CAGGTGGGTG CGTTCGCCGG GCTGCTGATT AGCCTGACCA TCTGGCTGGT CCAACTGGTT
ATCAGCGTCT TCTGGCTGAA GCGCTTCCGG TTTGGTCCGC TGGAGTGGGT CTGGCGCAGC
CTGACCTATG GCGCGCCGCA GACTATGGCG AAGACGCGTC AGTTGGCGGC GTGA
 
Protein sequence
MSHAIAPTTA RERIDLLDIL RGFALLGILI VNMGIFSFPF IAAFTGTPRG ESTFDHAVEF 
LTHALATGKF YPLFSFLFGL GMWLQMERVQ EAGGAPARFM VRRLLVLMGF GLAHALLIWN
GDILFIYALV GLVALLFRKA QPRTLLIWAS ALIAIPIILS AGLIVLGILV AGLAPTNASG
MDEVMTLVRD LERQAIETYA RGSWGQIFAW RAIEWLIVLV FFFLSGNVLQ ILAIFLIGMY
AGKRQVLQRL MQLPANERRL PAGRVCLVVG LVANFALTWL MWTVDMTSPL AGLPSVLLLI
FGPVLSYGYM AAFVALTRRE AWHRRLEPLA AAGRMALSNY IAQSIVCTLI FYSYGLGLFG
QVGAFAGLLI SLTIWLVQLV ISVFWLKRFR FGPLEWVWRS LTYGAPQTMA KTRQLAA