Gene Rcas_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0452 
Symbol 
ID5537915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp570937 
End bp572958 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content62% 
IMG OID640892615 
Producthypothetical protein 
Protein accessionYP_001430601 
Protein GI156740472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATGA CGAAACGCAC ATCTCGTCCA ATCATCTCCC GCCGTCGCTT CCTTCAGGCG 
TGTGCATGCA CGGTTGCAGC CGGGGCGCTT GGCGCAACGG GGTATGTCGT CGCCAACGCT
CCGCTTCCCC CACCGTATCC TGAATCATCG GTCTTTCAGA CGCCGGTTGC AGGTATACCG
ACGCCTGGCG CGCCTATCCT GTTGGTGACG AATCCAGGTG CGCAGCCGTC GTTTGGCGCG
TACCTTGGCG CTATTCTGCG CGCGGAAGGG TTCGTAGCGT TTCGGATGGC ACGCCTCGAT
GCGATCAATC CGGCGCTGCT GGCGCAGTTT CCGCTCGTGC TGCTGACAGT CGGTCCGTTG
ACCGCAGAAG CGTCCGATCT GTTCCGGGCA TATGTGCTGA ATGGCGGGCA TCTGATTGCA
TTTCGTCCCG ATCCGCGCCT CGCCGACCTC ATGGGTGTGC GCGCGCTCGG AGGTGATGTG
ACCGACGGAA TGCTGGCGGT CGCCGATCAT CTGCTCGCGC AGGGCATTAC CACCCAGGCG
CTTCAGGTTC ACACCCCGAT GGCGCAGTAT GAACTGGCGG GCGCTGAGGC GGTCGCCTGG
ATCGCCCGTC GTGACAGCAG CCGAACGTCC TACCCTGCTG TGACGCTGAT GCGCGCCAAA
AAAGGCATCG CTGCGCTGTG GGCATTCGAC CTGCCCCGTA ACATCGCCCT CATCCGCCAG
GGGAATCCGG CAGCGGCAAA CCAGGAGCGC GACGGGATGG AAGGCGTTCG GACGGTTGAT
CTGTTTGTGG ATTGGATCGA TCTTGATCGT ATTGACATTC CGCAGGCAGA CGAGCAGCAG
CGATTGCTCG CAAATATGAT CCATGCGCTG GCGGGTGAGG CGCCGCTGCC GCGCCTCTGG
CACCTGCCGG CTGGCGCTTC TGCGGTTCTG GTGGCAACCG GTGATGCGCA CGGACTGCTG
GCTTCCCATA TTGCTACGGC GCTGGAACTG GTCAGCCGGT ACGATGGCGC GCTGTCGATC
TACTATGCCC CGCCGCCGAT GAGCAACCGG TCGCGGACGC TGCGCCGGGT TCGTTGGTTG
GCTGAAGAAT TGCCGGTTGC CGGCGCGGTC TTCACCGACG ACGCAGGGTA CCCCACTCCG
AAAGATGTGG CGCGCTGGCG CGAAAGGGGA CACGGGTTCG GATTGCATCC CTACGTTGAA
CAGGGAGTGG GCAAGGGGTA TCACGAGTAC TGGAATACCT TCATCAAACT GGGGTATGGA
CCGGCTGAGC CAACCGTGCG CACCCATCGG GTGCTCTGGT CGGGGTGGGT CGAAACAGCG
CGGGTGCAGG CGCAGTATGG ACTACGCATG AGTCTCGACC ACTACCACAG TGGTCCGCTG
ATGCGTCGCG CAGACGGGCG CTGGGTTCAT GGGTACCTCA CGGGGAGCGG ACTGCCCATG
CCGTTCGTCG ATGAGCAAGG GAATCTGTTG CGAGTCTATC AGCAGCATAC GCACATTGTC
GATGAGCACC TGATGCGGGT GTTCGACACC GGCTACGAGA TGGGAGTGGA TGTCAATGAA
GCCATTGCCA TCGCGTGCCG GCAGATCGAT GCAGCGGTAG AGCAATATCC CTCGGCGCTT
GGATTACAGT GTCATATCGA CCCGTTTGCC TTTGGCGGCG AGAAGGCGGA GGCGGCGAGT
GTGTGGTTCG ACCGCGTGCT CGACCATGCG GCGTCGCGCG GGGTGATGAT TGTGTCGGCG
GAACAATGGC TGGCGTTCAC CGAGATGCGC GATCAGGCGG AGATGCGCAA CCTGATGTGG
AATGAGTCTG AGGGCGTGTT GATGTTCGAA GCGGTTATTA GTGCGGAGTC GCAGCGCGCG
CCGGCGCTTC TGCTGCCCCT GGAACACCGC AGGCGCATAC TGCGCCAGGT GACGATTGAT
AGCGTGCTGG CGAGCGCCGA GCAAAAGCGT GTGGGGGGAG TCGCCTACGG TGCGGTGGCG
CTGGCTGCCG GGAGGCGACA GGTGAGGGCA TATTATAGAT GA
 
Protein sequence
MTMTKRTSRP IISRRRFLQA CACTVAAGAL GATGYVVANA PLPPPYPESS VFQTPVAGIP 
TPGAPILLVT NPGAQPSFGA YLGAILRAEG FVAFRMARLD AINPALLAQF PLVLLTVGPL
TAEASDLFRA YVLNGGHLIA FRPDPRLADL MGVRALGGDV TDGMLAVADH LLAQGITTQA
LQVHTPMAQY ELAGAEAVAW IARRDSSRTS YPAVTLMRAK KGIAALWAFD LPRNIALIRQ
GNPAAANQER DGMEGVRTVD LFVDWIDLDR IDIPQADEQQ RLLANMIHAL AGEAPLPRLW
HLPAGASAVL VATGDAHGLL ASHIATALEL VSRYDGALSI YYAPPPMSNR SRTLRRVRWL
AEELPVAGAV FTDDAGYPTP KDVARWRERG HGFGLHPYVE QGVGKGYHEY WNTFIKLGYG
PAEPTVRTHR VLWSGWVETA RVQAQYGLRM SLDHYHSGPL MRRADGRWVH GYLTGSGLPM
PFVDEQGNLL RVYQQHTHIV DEHLMRVFDT GYEMGVDVNE AIAIACRQID AAVEQYPSAL
GLQCHIDPFA FGGEKAEAAS VWFDRVLDHA ASRGVMIVSA EQWLAFTEMR DQAEMRNLMW
NESEGVLMFE AVISAESQRA PALLLPLEHR RRILRQVTID SVLASAEQKR VGGVAYGAVA
LAAGRRQVRA YYR