Gene Rcas_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1687 
Symbol 
ID5539163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2172628 
End bp2174382 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content63% 
IMG OID640893824 
ProductPA14 domain-containing protein 
Protein accessionYP_001431797 
Protein GI156741668 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000321103 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACGAC GATTTCAACC TGGACAGGCG CTCGTCGAGT TCGTGATCGC CTCGACGCTC 
ATCCTGCTGC TGCTGGCGGC TGCGGTCGAT ATCGGTCTGC TCTTCTTCAA TATGCAGGGG
CTGACCACCG CAGCGCAGGA AGGCGCCACC TACGGCAGCC GCTACCTGGT CGTTCAGCCG
AACGGGACGG TTGACCTCGA CTACACTATG ATCCGCGCGC GCGTGCGCCA GGAAGCCGGC
ACAACCGGCG GCATCAACTT CGTCAACATG TACGACCTGA ACAGCGATGC CATTCCCGAC
GCCGAAGATA CCAACGGCGA CGGCGTGCTC GATCATTTTC AGTACTTCCT CGATCAGAAC
GGTGATGGGC GCGCCATTGG CGACCCGGTC ACCGGACTGA TCCCCGAAGG AACGCAGCCT
GCCGGGTATG TCCGGTTGAT CGACCAGTAC ATCCTGGTGC AGGCGATTGA AGACGTCTTC
CCCTTCAACG GCAATCCGCT GGGACCGGAG GACGCCAACG GCAATCGCGC CGACGACCTG
GTTGCCGATG TCGATACCAC GCCATGCGCA AATCTGGCGG ATCCGAACCG GCAGTGTTAT
GTCTTCGTGA TTGTGAAGTC CGACTACAAC ACGGTCTTTG GCTTCACGCC CGCATTCGGC
GACAAAGTGC CGCTCAGTTC GCGCTTTGTG ATGCCGCTGC GCGCTGGCTT TGTCTCGCCG
GGCGCGCCGA CGAACACGCC GGTGGTGCAG ACCAATACGC CAACACCAAC GCCAACCGAT
ACGCCCACTC CAACTGCAAC GAATACGCCG ACGCGCACGC CAACCAATAC GGCGTCGCCG
ACGACAACCA ATACACCAAC CAACACGCCG ACGCGCACCA ATACGCCGAC GCGCACCAAT
ACGCCGACGA ACACCAATAC GCCGACGAAC ACCAATACGC CGACGCGCAC GCCAACGTTT
ACGCCCACAC CGACGCCATG TGCTGGCGGC ACGGGCAACG GTCTGCGCGG TGACTACTTC
ATCTACACGC CGGGCGGCAG CGTTGGCACG ACCAACTTCT TCCCCGGCGC GCCGGTCGCC
AGTCGCCTCG AGAATATCAA TATGGCGCGG AGCGATACCT CGCCCATCGC CGGCGTCGGC
AACGACTACT TCTCGGTGCG CTGGACGGGG CAGGTCGAGC CGTTGTTCAG CGGCGAGTAC
ACCTTCTACG CTAACACCGA CGATGGCGTG CGCGTGTGGG TGAATGGCGT GCAGATCATC
AATGACTGGC GCACCAAGAA TAGCGAAACC AACGGGAGGA TCACTCTGAC CGCCTGCCAG
CGCGTGAACA TTACGGTCGA GTACTTCGAG TGGACCGGTA GCCAGAACGC AATCCTCTCA
TGGCAGCACG CGAACGTGCC GAAGCAGGTC ATCCCCATCC AACGGCTCTA TGCGAGCGGC
TCACCGCCGG CAACCGCGAC GCGCACGCTG ACGCCGACGC GCACCGATAC GCCGACGCCG
TCGCGCACGC CGACGCGCAC GCTGACACCG ACGATTACCA ATACGCCGAC GCCGTCGCGC
ACGCCGACGC GCACGCTGAC ACCGACGATT ACCAATACGC CGACAAATAC GCCGCCGGCG
ACGAACACGC CAACCCGCAC GCCGACGCTC ACACCGTCGC ACACGCCGAC GCTCACACCG
TCGCGCACGC CGACGAATAC GCCAACCCGC ACGCCGACGC TCACACCGTC GCGCACGCCG
GACACCGGAA CGTAA
 
Protein sequence
MARRFQPGQA LVEFVIASTL ILLLLAAAVD IGLLFFNMQG LTTAAQEGAT YGSRYLVVQP 
NGTVDLDYTM IRARVRQEAG TTGGINFVNM YDLNSDAIPD AEDTNGDGVL DHFQYFLDQN
GDGRAIGDPV TGLIPEGTQP AGYVRLIDQY ILVQAIEDVF PFNGNPLGPE DANGNRADDL
VADVDTTPCA NLADPNRQCY VFVIVKSDYN TVFGFTPAFG DKVPLSSRFV MPLRAGFVSP
GAPTNTPVVQ TNTPTPTPTD TPTPTATNTP TRTPTNTASP TTTNTPTNTP TRTNTPTRTN
TPTNTNTPTN TNTPTRTPTF TPTPTPCAGG TGNGLRGDYF IYTPGGSVGT TNFFPGAPVA
SRLENINMAR SDTSPIAGVG NDYFSVRWTG QVEPLFSGEY TFYANTDDGV RVWVNGVQII
NDWRTKNSET NGRITLTACQ RVNITVEYFE WTGSQNAILS WQHANVPKQV IPIQRLYASG
SPPATATRTL TPTRTDTPTP SRTPTRTLTP TITNTPTPSR TPTRTLTPTI TNTPTNTPPA
TNTPTRTPTL TPSHTPTLTP SRTPTNTPTR TPTLTPSRTP DTGT