Gene Rcas_0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0751 
Symbol 
ID5538217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp981311 
End bp982459 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID640892907 
Producthypothetical protein 
Protein accessionYP_001430890 
Protein GI156740761 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00031079 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.110422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA ATCGTGAAGT GGGCGGGCGA TTGCCTTATA GTGAGCCGCT TTTGCATACG 
CTTCTCCGTC TGCTCATTGG CGGAGGCGTT CTCTTGTGTG GCTTGCTTCT GGCGCAGCGC
CTGCCGAACC CTGATCTATT CGGCGATTAT GTCGCTGCAT GCGCCTGGTG GCAACGTCTT
CCCGAACACC TCTGGCTGGC GGGTCTAAGC AGGTGCGATG CAAACGTGGA TTACAATGCG
TTTGGTCCTG GTGCGCATCC TCCCTTCTCA ACGGTCTTTT TCCTTCCGCT TGGCTTCCTC
GCCTGGACCG ATGCTCGCTT AGCATGGCTT ATCATCAGTG GCGCGTGTCT GATAGGTGTC
TGGCACTACT ACCGCGTTCC TGTCAGTGTC TGCGCTGCAA CGGCGCTTTT TGGCGTCTTC
GGGTTGTATC GCGGAACGAT GGAGCCTTTT CTTTTCGCGC TGATGATGGT CGCGCTGTCA
CAGGAAGAAG AGCGTCCGCT CTTCTCTGCC GCGCTGATCG GTCTGGCTGC GGCGATCAAG
GTCTATCCGG TCCTGATGCT GGCGGCGCTG GTCATTGCGC GTCGTCTCAA TGCCTTGATC
GCAGGCATTG TTACCGGCGG ACTTGCGACG GCCGCCGGGG ATTTGGTGCT GGGAATCGGG
AAAACCGGCG CCTGGATGGG GCATATGACT CCTAATGCCC TGGCATGGCG GATTAATCCG
GACAATCTTT CGCTGGTCCG CATTGCAGGG GACTTCGTTC CGCAACTCTC GCCGTTGGTG
GTGGCAGTCG CTCTCTTTGG TGCGGCGGTG GCGCTGCTCA TCAACGCACC GCATGGACAG
GTGCGGATGC ACACTCTCGT ACCGACCACT CTGCTTGTGA CGCCATTGGT ATGGAGCCAC
TATATTGTTC ATACAGGTCT GCTTCAGTTG ACGCGCCTTG AGCAGGTGTT ACTATTCGCG
GGCAGTGGAT TGATCTTTTT GGGTATACTG GGCATCTTCC CGTTCCAGAG CGCTGCCATT
GCATACGGAC CGGTGCTCGC AGCACTGGTG TTGATCTGGC ATCGCGCATG GCGATCTGGC
ACCGAACTTT CTGTTCGGAA GAAACTGTCA GGCGCCGCCC CTTCCGAATC CTCCATACCC
AACTGTTGA
 
Protein sequence
MKRNREVGGR LPYSEPLLHT LLRLLIGGGV LLCGLLLAQR LPNPDLFGDY VAACAWWQRL 
PEHLWLAGLS RCDANVDYNA FGPGAHPPFS TVFFLPLGFL AWTDARLAWL IISGACLIGV
WHYYRVPVSV CAATALFGVF GLYRGTMEPF LFALMMVALS QEEERPLFSA ALIGLAAAIK
VYPVLMLAAL VIARRLNALI AGIVTGGLAT AAGDLVLGIG KTGAWMGHMT PNALAWRINP
DNLSLVRIAG DFVPQLSPLV VAVALFGAAV ALLINAPHGQ VRMHTLVPTT LLVTPLVWSH
YIVHTGLLQL TRLEQVLLFA GSGLIFLGIL GIFPFQSAAI AYGPVLAALV LIWHRAWRSG
TELSVRKKLS GAAPSESSIP NC