Gene Rcas_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2072 
Symbol 
ID5539552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2657757 
End bp2658893 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content62% 
IMG OID640894207 
Producthypothetical protein 
Protein accessionYP_001432176 
Protein GI156742047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTCA TACCAACGGA AACCATTGCG GCAATCCACG ATACGCCGCA CCGACTGGTG 
TTCGAGTTCG CCGGCGCCGG CAGCCTGGCG CTCTATTGGC TGCACAGCGT GCCTGGCTCA
TCGCGCACTG TGCTGGAAGC GACCGACCGG TACGCAGCGA CATCGCTCAC CGACCTGATC
GGCAAGACGC CGGAAAAGTT TGTTTCTGCC GACACGGCGC GCATCATGGC TGAGATGGCG
TACCGCCGCG CCATGCGCCT GACCGACGGC GCCGCTGCGT GCCTCGGAGT CGCCTGCACC
GCCGCGATTG CCACCGATCG CGCCAAACGC GGCGCGCACG GCTGTTCTAT TGCCGTGTAC
GACGGCACAA CGATGCGCGC GTTCAACCTG ACGCTCGCCA AAGGCGCGCG CGACCGCGCC
GGCGAGGAAC AGGTGATCAG CCTGCTGATT ATACGCGCAA TCGCCAGCGC TTGTGGCGTC
GCTGCGCCCG ATCTTGCGCT GGAACCTCCC GAAACGCTGG AGGTGGATGA GGAGACGCGA
CCCGATCCGC TGACGCTTCT TGTGCAGGGG GATGTCGAAG ACGTTTTTAT CGACATCGAT
GGGCACGCAC ATCTGAAAGG GACACCGCCG GTCGCACTGC TGTCCGGTTC GTTCAACCCG
CTCCACGCCG GGCACGAACA ACTGGCACAA GCAGCCGCAG CCTTCCTGCG CGTACCGGTT
GTTTTTGAGC TCCCCATTCT GAACGCCGAC AAGCCGCCAC TCGGATATGC CGAACTGGAA
CGCCGCCTGG AGCAGTTTCG CGGACGTTAC CCCGTCGTGC TCAGTCGCGC ACCGCTCTTT
GTGCAAAAAG CGAACCTGTT TCCAGGATGC ACCTTCGTCA TCGGATACGA TACCGCAATT
CGAATCATCG ATCCGCGCTA CTACGATGGC GAAGCCGGAC GCAACGCCGC CTTCGCCGCT
ATCGCCGCCC ATGGATGCAC ATTCCTGGTC GCCGGGCGTA TCAAGGATGG CGTCTTCCGT
ACCCTGGCAG ATATCGACCT GCCGGCTTCA TTGCGTCCAC TCTTCCGTGA ACTGCCTGAG
CGCATATTCC GCGTCGATCT CTCCTCGAGC GCCATCCGCA ACGCTTATGG CACATAA
 
Protein sequence
MNLIPTETIA AIHDTPHRLV FEFAGAGSLA LYWLHSVPGS SRTVLEATDR YAATSLTDLI 
GKTPEKFVSA DTARIMAEMA YRRAMRLTDG AAACLGVACT AAIATDRAKR GAHGCSIAVY
DGTTMRAFNL TLAKGARDRA GEEQVISLLI IRAIASACGV AAPDLALEPP ETLEVDEETR
PDPLTLLVQG DVEDVFIDID GHAHLKGTPP VALLSGSFNP LHAGHEQLAQ AAAAFLRVPV
VFELPILNAD KPPLGYAELE RRLEQFRGRY PVVLSRAPLF VQKANLFPGC TFVIGYDTAI
RIIDPRYYDG EAGRNAAFAA IAAHGCTFLV AGRIKDGVFR TLADIDLPAS LRPLFRELPE
RIFRVDLSSS AIRNAYGT