Gene Rcas_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4068 
Symbol 
ID5541579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5280673 
End bp5281971 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID640896180 
Producthypothetical protein 
Protein accessionYP_001434118 
Protein GI156743989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.233618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTGTA TCCAACTCCG TGCAGTCTGC CTGCTGCTGG CGCTGCTCCT GATGATGAGC 
GCTTGCGCCG GGTCGCCCGC CACCGTCCCC ACAACCATAG CGCCGACCGT TGCACCCGCA
GCCGCACCAA CCGCCGCGAC ACCGCCGACT GCCGCACCTG CGCCGACGGC GCCGCCTGCT
GCCACAGAGA TGCCGACGTC GGCAAATGCC GTCCTTCCCG CGCCGCTCTA CGTTCTTGAT
ACCGGGCAGA TTGTGCGTAT CGAGCGTGAC GGTGTCACAC GCAAACAGAT CACGAATGAG
GCGCCGCCTG CGCCGGATGC GCTGGCAATC GTCCAGTTCG ATGTGTCGCC GGTGGATGGA
ACGCTGGTGT ACCTTGTTCA GGGAATCGGT ACGCCGCCTG TACTGGTGCG CACCGACGCC
GACGGCGGCA ATCGTGCGAC GCTGCTTGAT AGCATGCCCG TCGCTGCGCC CGTGATTGCG
CCGAATGGCG CGACGATGGC GTTGCGCGTT TTCGAGGATT ATGAACGTCC AGGCACGTAC
ACGCCAGGTC TCTATCTGAT GCCGGTCGCT GGCGGCGAGC CACGCCTGAT CCTGGCGGAT
AAACCGGCGA CCGATCCCTC TATCGAAGGA GGCGATGGGC GTGGCTTCGA GGCGGTTGCG
TGGTCGCCGG ATGGCACGAA ACTGCTGGTG CACGCCTTCT CGCTCTCCGT CGAACTCTGT
GAACTGGCGA TCGTCGATGT GGCAAGCGGC GGCATCGTCT ATCTGGCGGC GCCTGAGCCG
AATCTGGTTG CAGCCTGCAC CGCCGCTGCA TGGACGCTCG ATAGCAAGGC GGTCTACTTC
AGCGTCGCCA ATCCGGGGAA GGGGTTCAAC GAGCCGGGCA TCTGGCGCGG CGATGCGATG
AGTGGTGCAG CGACCTCTGT GCCTATCGAA CCGTCAGATG CCCTGCTGCA TATGCCGTTT
TTCGCCATCG ACCGGTTGTA TGCGTTTGTG TCGTCGGCGC CTGGCGAGAA CCCGGTCTCC
TCACCTGCCG CCGATCCGGC GGAACTGATG GCGCTCTCAT ATACCATGAG CAGCGTGCCG
TTGACCGGCG GGGCATTCGC AGCGTTACGC AGCGATGCCC ATCAACTATA CCAGGCGTTG
TGGGCATCGG ACGGCTCAGG CGCCGTCATC TTTGAGAGCA GCGATCCGTC CGCAGTTCGC
CTGCTCTGGC TGCCCACCGA TGGCTCGCCC GGCGTTGAGT TGTACAAGGG AACAGACCTG
TACTCCGTGC GTTGGGGCAA GCGTAGCCAG AAGCATTAG
 
Protein sequence
MFCIQLRAVC LLLALLLMMS ACAGSPATVP TTIAPTVAPA AAPTAATPPT AAPAPTAPPA 
ATEMPTSANA VLPAPLYVLD TGQIVRIERD GVTRKQITNE APPAPDALAI VQFDVSPVDG
TLVYLVQGIG TPPVLVRTDA DGGNRATLLD SMPVAAPVIA PNGATMALRV FEDYERPGTY
TPGLYLMPVA GGEPRLILAD KPATDPSIEG GDGRGFEAVA WSPDGTKLLV HAFSLSVELC
ELAIVDVASG GIVYLAAPEP NLVAACTAAA WTLDSKAVYF SVANPGKGFN EPGIWRGDAM
SGAATSVPIE PSDALLHMPF FAIDRLYAFV SSAPGENPVS SPAADPAELM ALSYTMSSVP
LTGGAFAALR SDAHQLYQAL WASDGSGAVI FESSDPSAVR LLWLPTDGSP GVELYKGTDL
YSVRWGKRSQ KH