Gene Rcas_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3809 
Symbol 
ID5541311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4979268 
End bp4980485 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content62% 
IMG OID640895919 
Producthypothetical protein 
Protein accessionYP_001433866 
Protein GI156743737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATC TATCGCTTCG CCCACAACCA CTCGGTATCT TTCCCGCACC GACGGGGTAC 
CTGGTGATTC CGCCGGTTGC CGGCGCCGAA GAGGTCTGCG CTGCGCTGCT TGCAGGGCAC
ACGCCGGAAC ATATGCCCGA TGCGCTTCGC TTTTACACGC TGGCGCTGGT GGATGATCGT
GAAAGCGCCT GGCGCGCGTT GGCGTATGAT TCCTCACCGG AAGCGCACTA CAATCGTTTT
GTGCTGCACA GTGACCCGGA TATCTATCCA TATTTGCGTA AGCAGTTACG CGGCGATCTG
GCTGCCCTGC TCGACTTCGT GGCGTATATG GTCGGCTTAA GAGATGCGCC GCCTGATGCA
GACGCAGTAT GTGGCGAAAT TGCCGCGTGC ATCCTGCCGG CGCATGCTGC GGATGCGCTT
GCCCGCCAGC AGTACGATGC CGCTATCGCC GCGCTCCAAC GCGCAGTCGA AGAAGTTCGG
CACATTTCGC CATTGTTTGC TGCGCAGCTG CTGGATCGTC TGGCGACGAT CCACGCCGGT
ATCAGCCAAT CGGCGGCGGC GCTTCAGGCA TTGCGCGATG CCGTGAAACT GGCCGGCGGG
GGGCGTCGCC TCGACCTGCG CGCGTATCTG GCGTTGCGGT TGGGGATGTT GTGCCAGGAT
CTCGCTCATG GGCAGAGGAA CCTGCTGATT GAGGCGAACA CATGGTTCGA GGAGGCGTTG
CGTTGCTGCT CGATTGAGAG CGACCCCGAC CTCTACGCGC TGGCGCATTA CCGGCTGGCG
CTGACGATCC TGGCGCTTGC GCCTGCGGGC AATGGCGATC AGATATTGCG CGAACGAGCC
ATTCAGTCGT TGCGGGAGTC GCTCCGGGTC TACACCTGCG ATACGCACTA CGAGCAGTGG
CTCAATGCAC AGGTTACGCT TGCCAATGCC TTGCGGATTT CGTTTGTTGC ATCCCCTGCC
AATCATCTGA TCGAAGCGGT GCGCCTGTAC GACGAGGCGC TGGCAAGCCG CGATCAGGAG
TGTGATCCGA TCTGGTACGG ACGTCTGCTG GCGAACCAGG GGAATGCGCT GTTCCATCTT
GGCGATTTTG CCCGCGCCCG TGACCGTTTG ATCCGCGCCC GCGCGATCTT CCTTGCTCAC
CGTGACTATG GCGCTGCGGC GTTGCTCGAC GAGGCGCTGG TCGAAATTGA GTGCCGGGGG
TTAGGGGTAC GGGGCTAG
 
Protein sequence
MAHLSLRPQP LGIFPAPTGY LVIPPVAGAE EVCAALLAGH TPEHMPDALR FYTLALVDDR 
ESAWRALAYD SSPEAHYNRF VLHSDPDIYP YLRKQLRGDL AALLDFVAYM VGLRDAPPDA
DAVCGEIAAC ILPAHAADAL ARQQYDAAIA ALQRAVEEVR HISPLFAAQL LDRLATIHAG
ISQSAAALQA LRDAVKLAGG GRRLDLRAYL ALRLGMLCQD LAHGQRNLLI EANTWFEEAL
RCCSIESDPD LYALAHYRLA LTILALAPAG NGDQILRERA IQSLRESLRV YTCDTHYEQW
LNAQVTLANA LRISFVASPA NHLIEAVRLY DEALASRDQE CDPIWYGRLL ANQGNALFHL
GDFARARDRL IRARAIFLAH RDYGAAALLD EALVEIECRG LGVRG