Gene Rcas_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4307 
Symbol 
ID5541818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5556845 
End bp5558251 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content62% 
IMG OID640896413 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001434351 
Protein GI156744222 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTATC CTCGTCTTCT CATTATCGGC CTCGATTGCG CCGAACCATC GCTGGCGTTC 
GATCAATGGC GCGCCGATCT GCCCACCCTC AATCGCTTGA TGGCAGAGGG AGTCTACGGC
GAGTTGGAGA GTTGCATTCC GGCAATCACC GTGCCTGCCT GGAGTTGCAT GATGAGCGGG
CGCGATCCGG GGGAACTTGG CGTTTACGGC TTTCGCAACC GCGCCGATCG TTCCTATGGT
CGTATGGTCG TCGCCGATAG TCGTGCCATT CGTTTTCCAC GTTTGTGGGA CATCCTTGGC
AATGCGGGAT GGCGCGTCGC GGTGATTGGC GTGCCGGGGA CGTACCCGCC ACCGCCGGTC
AATGGCGCGC TCATCTCGTG TTTCCTCGCA CCCTCCACGG ACGCAGCGTA TACCTTTCCG
CCGACACTCG CCGGGCGCGT TGCCGCCTGG ACCGCAGCCG CGACGCCGGG GCGTCCGTAT
CTGCTCGATG TGCCGGATTT CCGTTCCGAC GACAAACAGC GTATCGCGCG CGACATCTAT
GCCATGTGCG ATCAGCGCTT CGCAGTGGCA ACGGCGCTGC TGGAAGAAGA GCATCCCGAC
TTTCTGATGC TGGTGGATAT GGGCGTGGAT CGCATCCACC ACGCGCTCTG GAAGCATATG
GACCCGCGTC ATCCGTTGTT TGTTCCCGAC TCGCCTTTCG CCGACGCCAT TCGCGCGTAC
TATCGTCACG TGGATACGCA GATCGCCGGT CTGCTGACGC GCTGCGGACC CGACACGGCA
GTCCTGATCG TGTCGGACCA CGGCGCGCGC CCGTTGATGG GAGGCGTGCG GATCAACCAG
TGGCTGATCG AACAGGGTGA TCTGAGCGTC CGGGCAATGC CGGACACCCC GACGAGTCTC
GATCAGGTCG AGGTTGACTG GTCACGCACG CGCGTCTGGG GCGCCGGCGG CTACTACGGG
CGAATTTTTC TCAATGTGCG CGGGCGTGAG CCGCAGGGAG CCATCTCAGC AGCAGAGTAC
GAACGTGTGC GCACCGATCT TGCAGCGCGC CTGGAAGCCA TGCCAGGACC CGACGGATGT
CCGCTGGGCA ATCGTGTTTT CACACCCCGG CAACTCTACC GCGCAGTGCG TGGCATCGCG
CCCGATCTGA TCGTCTACTT CGGCGATCTT GGATGGCGCG CGGTCGGAAC GATCGGCGGC
ACGGGCATCT TCACCCAGGA AAACGACACC GGTCCTGATG ACGCCAATCA TGCGCAGCAC
GGAATGTTCA TCTGGCGCGA CCCGCAACGT CCGGGCGGCG GACGGCGATT CGACAGGGTG
CAGATTTACG ATATACTGCC GACTCTGTTG AGACGGTTCA ACATGCCGAT TCCTGAAGGA
CTACGCGGCA CGGCGCTGAA TCTATAA
 
Protein sequence
MTYPRLLIIG LDCAEPSLAF DQWRADLPTL NRLMAEGVYG ELESCIPAIT VPAWSCMMSG 
RDPGELGVYG FRNRADRSYG RMVVADSRAI RFPRLWDILG NAGWRVAVIG VPGTYPPPPV
NGALISCFLA PSTDAAYTFP PTLAGRVAAW TAAATPGRPY LLDVPDFRSD DKQRIARDIY
AMCDQRFAVA TALLEEEHPD FLMLVDMGVD RIHHALWKHM DPRHPLFVPD SPFADAIRAY
YRHVDTQIAG LLTRCGPDTA VLIVSDHGAR PLMGGVRINQ WLIEQGDLSV RAMPDTPTSL
DQVEVDWSRT RVWGAGGYYG RIFLNVRGRE PQGAISAAEY ERVRTDLAAR LEAMPGPDGC
PLGNRVFTPR QLYRAVRGIA PDLIVYFGDL GWRAVGTIGG TGIFTQENDT GPDDANHAQH
GMFIWRDPQR PGGGRRFDRV QIYDILPTLL RRFNMPIPEG LRGTALNL