Gene Rcas_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1307 
Symbol 
ID5538779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1685665 
End bp1686831 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID640893445 
Producthypothetical protein 
Protein accessionYP_001431422 
Protein GI156741293 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.323452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.304058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTC TACTCAAGGG AGTGTCGCGT GAAGAGGTGC GAGAACTAAT TGCGCTGAAC 
GAAACACCGT GGGTTTCGTT ATTCCTGGCG CCCCATCCGC CCGGTGAAAC CATCCAGCGC
GCGATTCAGC TCGAAAATCT GCTGCGCCGC AGTGAGGCGG AACTTGAGGC GAGCGGGTAT
GATCGCGACG ATGCCCACGC GATGCTGGCG CCCGTGTGGG AGATGACCGA CGCGCGGACT
CTGGAGGAGT ATCAGGACGC TGGACTGGCG TGCTATGTTG CACCCGGTCA GTTCCACCTC
TATCGCCTGC CGCACCGCGT CAGCGACGCT GTGATCGTCG GGCGGCGCCC TTTCATTAAG
CCGCTGCTTA TGCCCCGTCC GGCAACCGAC TCGTTCTATG TGCTGGCGCT AAGCAAGAGT
CGTGTGCGTT TGCTCCACGC TACACCGTCG GGCATCACCG CCGTGCCGCT TCCCGACGCG
CCTGCCGGTA TCGACGATTT GCCGCAGACC GACCCGACCG GACGCCAGGC GCAGCGCCAT
GTTGCTCCTT CGACGCGCGG CGGCGCCAGC GGTGCGATGT ACCCCGGTCA CGGCGGCAAC
ATCTACGACG AAAAAGCCGA GGTGCAGCGT TATCTTCAGG CAGTGAGCAA TGCGGTCGAA
CGTGCGTTGA GTCGCGCGCG CGATCCGCTC GTGCTGGCAG GCGTCGATTA CATGGTGTCG
ATGTACCGTG CATTGAATGG CTATGCGCAC GTCATCGACA CCCATATCAG CGGTAGTCCT
GACCACGTGA ACGATGAAGC CCTGGGTGAA CGCGGAGCGC ACGTGCTGAT GACGCATCGA
AGCCGTCTGG CAACTGATGA GCGCGATCGC TTCGAGGCGC TGTTGCAATA CAACCCGCCG
CGCGCCAGCA CGAATCTGCG CTCAATCCTG CCGGCTGCAC ACGCCGGTCG TGTGGCACGA
CTCCTCGTTG CCAGCGATCG GCAGATGTGG GGACGCTACA ATCCCGATGA CGAAACGATC
TCGCTCCATG ATGAGCCGCT GCCGGGCGAT GATGACCTGC TGGACATTGC GGCGCAGCAA
ACGCTGCTCC ACGGCGGCGA AGCCGTTGCG GTTCCGGCAA CGGATATTCC CGGCAGTAAC
GGCGTGGCAG CAGTTTTTCG CTACTGA
 
Protein sequence
MKVLLKGVSR EEVRELIALN ETPWVSLFLA PHPPGETIQR AIQLENLLRR SEAELEASGY 
DRDDAHAMLA PVWEMTDART LEEYQDAGLA CYVAPGQFHL YRLPHRVSDA VIVGRRPFIK
PLLMPRPATD SFYVLALSKS RVRLLHATPS GITAVPLPDA PAGIDDLPQT DPTGRQAQRH
VAPSTRGGAS GAMYPGHGGN IYDEKAEVQR YLQAVSNAVE RALSRARDPL VLAGVDYMVS
MYRALNGYAH VIDTHISGSP DHVNDEALGE RGAHVLMTHR SRLATDERDR FEALLQYNPP
RASTNLRSIL PAAHAGRVAR LLVASDRQMW GRYNPDDETI SLHDEPLPGD DDLLDIAAQQ
TLLHGGEAVA VPATDIPGSN GVAAVFRY