Gene Rcas_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3801 
Symbol 
ID5541303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4972017 
End bp4973294 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID640895911 
Producttetratricopeptide TPR_4 
Protein accessionYP_001433858 
Protein GI156743729 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGA CGCACCGACT GCTCCATCCG CAGCCGCTCG GCATCTTCCC GCTTCCTGCC 
GGGTATCTGG TCATACCGGC AGTGGACGGC GGCGCTGACC TGTGCGCATC ACTGTTGACC
GGACGTGTGC CGGATGCCAT GCCAGACCCA CTGCGCTTTT ACGCGCAGGC GCTCGCGGGC
GACGCGGATG GAGCATGGCA GGCGCTGGAG CGCGATCCAT CCCCCGAAGC GTGTTACAAC
CGCTTCGTGC TGCGTAGCGA ACCGTCAGAT TACGCTCGAT TGCGCTCTGA ACTCGATGGT
GAACTTCACG CACTTCTCGA CCTTGTTGCG TATACCACCG GCATGATCGA CCGACCGCCG
GAGCCGGTAG GGTGTAGCGG CGAAGTGGCG GCGTGCATTC TTGCTGCGCA CGCCGCACAT
GCGCTGGAGA CCGACCAACG CGAGGTCGCC TGCGCGATGT TGCAGCAGGC TGTCGCCGAA
GTGCGCGACA TATCGCCGCT GTTTGCTGCG CAATTGCTCG GCGATCTGGC AGAGATGCAG
GTGCTCGATG ATGATGCCGA TGCTGCAATC CGGTCGCTCC ACGACGCGCT GGATCTGATC
GGGTTGTACA GCCTGCCCGA TGTGCGCGCG CATCTGGCGT TGCGGCTGGG TATGCTCTAC
CAGGAACGCG CTGAAGGACG GCGCGCCATG CTCCTCGAGG CGAGCAAATG CTACCAGGAA
GCGCTACGCT TCTACACCCG CGAGACAGCG CCCGAACTCT ACGCGCTGGC GCAGAATAAC
CTGGCGCTGG TCTACCTGGC AATGCCCCTG ACCGAAGCCA GCGATCAGTT GCGGAAAGGG
ATTGCAGTGC AGGCGCTGCG CGAAGCGCTG ACGATCTACA CCCGTGACAC ACATCCCGAT
CTCTGGCGCC GCACTCAACT CAACCTGGCG AATGCGCTCC AATACCTGCC ATCGGCGAAC
CTGCGCGACC ATTTGATCGA GGCGGTTGGC CTGTACGACG ATCTGCTGGC GACCCGCGAC
CTGCGCCACG ATCCGGCAGG ATATGCGCGG GTTCTCGCCA ATCAGGGGAA TGCGCTGGCG
CATTTGGGAG ACTTTGCGCG TGCGCGTCAG CAGTTGAGCG AAGCGCGGCG CCTGTTTGCC
GTGTGCCAGG ATGCCGATGC CATCGCTGGC GTCGATGAGG TTCTGGAAGA GATCGCGCGG
CAGGAAGCAG GATCACGCAA GGGCGAAGCG ACAGCCCCGG CGCTGCGGCG CGCGCCAGCA
GGCGAGAATG GTTATTGA
 
Protein sequence
MSMTHRLLHP QPLGIFPLPA GYLVIPAVDG GADLCASLLT GRVPDAMPDP LRFYAQALAG 
DADGAWQALE RDPSPEACYN RFVLRSEPSD YARLRSELDG ELHALLDLVA YTTGMIDRPP
EPVGCSGEVA ACILAAHAAH ALETDQREVA CAMLQQAVAE VRDISPLFAA QLLGDLAEMQ
VLDDDADAAI RSLHDALDLI GLYSLPDVRA HLALRLGMLY QERAEGRRAM LLEASKCYQE
ALRFYTRETA PELYALAQNN LALVYLAMPL TEASDQLRKG IAVQALREAL TIYTRDTHPD
LWRRTQLNLA NALQYLPSAN LRDHLIEAVG LYDDLLATRD LRHDPAGYAR VLANQGNALA
HLGDFARARQ QLSEARRLFA VCQDADAIAG VDEVLEEIAR QEAGSRKGEA TAPALRRAPA
GENGY