Gene Rcas_3446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3446 
Symbol 
ID5540945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4501580 
End bp4503349 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content59% 
IMG OID640895564 
ProductSCP-like extracellular 
Protein accessionYP_001433514 
Protein GI156743385 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.118078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAACT GGCAGAGATT GGCGCTTATA AGCATTGCGC TTTTCGGATG CTTTGTTCCG 
GTGCAGAGCG CGCCTCCTAC TCCGGCGCAA ACCCCGGATG TGCGGCATAA CGAAGCGCAA
ACAGTCTATC TCAGCAACCT GGCGCGACGC GCCAATGGCA TCCCTCCGTT GCGCTGGAAC
CGCCAACTCA CTGCGGCTGC ACGCTGGTTC GCGTGGGATT CGACCGAGAA CCGTCCGGGA
TGGTACTGTG GTCACCAGGA CACCAACGGA AACTGGCCCG ACTATCGTGC GCGCGCTTTT
GGATACCTGG CCCTCGCAGG CGCGGAAAAT GCCTTCTGCG GATATGTCAC CCCCGCAGGA
GCGGTCCAGG GATGGATGAA CAGTCCGGGT CATCGCGCGA ATCTGCTCGA TCCCGTTTCT
CGCGAGATTG GATTGGGATA CTACCGGCGT TCCGGCGATG GACGAGGTTA CATTGCACAA
ATGTTTGGCG CCGATCCATC CTACGCGCCG GTCATTATTG AGAACGAAGC CCTCTCGACA
CCCCTTGCCC GGGTCAATCT CTATATCTAC AATCGCGCGG AGCAGAACGG CTTTGCGGGG
CTGGGTCCGG CGACGCAGAT GATGATCAGC GCTGATCCCT GCTTTGCCGA CGGCGTCTGG
ATGCCATTTT CCAACGAGAC AACCTGGAAT CTGGAAGATG GACAGGGCTG GCGAACCGTC
TACGTTAAAA CCCGCGATCC GCTTAACCGC ACCGCCACGG TGAGTGACAC TATCTACCTG
GGGGGAACGG CGCCGCTTCA GGAACTCGAT GACGAATACC TGAGCACAAC GCAGCCTTCT
GTAACACTGT ATCGTCTGGA TCAGAGTGGA TGGCCAATGG TGCAATTGAG CCTGGGATGG
ATTGCTGATG ATGCCTACCC GACGTTTAGC AGGTTGTGGG GGAACGGCGA GCGCGTCACT
GACCCGGACG CCTGGGGCGG CACTGCATAC CGACTTGCAA CAAGCAGCGG CGAGTCATCT
GCATGGGTAT GGGACACTTC CTTTATCAAG GATACGCCGC TCGTCGCCTA TTTCCGCATT
AAGGTTAGCA GCAATGAATC AAGCAGCGAA GTAGCGCAAC TGACGGTCAT AGGCGGCGGG
ACGGAATACG GTCCGCTGCG GTTGCGCGGC GTTGAATTTT CCGCGCCCAA CCGGTATCAG
GAATTTGCGC TGCCATTTAC GTTTCACTCA AATCCCAACG AACCTTTCCT GATCTTTCAA
ATCCGGCGCA GCGGCGCTGC TGATGTGTTT GTCGATGCGA TCACCATCTT CAGCGCGTCT
CAACCGGCGA CCTCTCCGCT CATCTGGCAG GTTCCCGGGG GAAACTACCG CGGGCAGGGG
GTGTGGGCGC GTTTCACGGA TGGCGCGCGT TTCTCCGAGA TGACGGAAGC GGCGACGATG
CCCGCCCGTC TGAGTGTTAC ACCGGCGGGT CTGCGGTTCC TGGCTCAGCG CAGCGGCGCT
CCACCGTTGT CGGCACACCT GCGTGTTGCT GCCACCTGCG CACCTCCCGG CTGGCAGGCG
ACACAGGATG TTCCCTGGCT CCAGACCGAA ATTGTCGGCA GCGCAGTGCG AGTCAGGGCC
AACCAATCTG GATTGAACAA CGGCGTCTAC ACCGGCAGTA TCACCATCTC AGCGCCAGAT
GCTGGCGTCG CCCCAATTGT CGTTCCGGTC GAACTGATCG TTGTCGATCA ACTGTTTTCT
GTGCATCTGC CGCTCATCAA CAAAAATTAA
 
Protein sequence
MRNWQRLALI SIALFGCFVP VQSAPPTPAQ TPDVRHNEAQ TVYLSNLARR ANGIPPLRWN 
RQLTAAARWF AWDSTENRPG WYCGHQDTNG NWPDYRARAF GYLALAGAEN AFCGYVTPAG
AVQGWMNSPG HRANLLDPVS REIGLGYYRR SGDGRGYIAQ MFGADPSYAP VIIENEALST
PLARVNLYIY NRAEQNGFAG LGPATQMMIS ADPCFADGVW MPFSNETTWN LEDGQGWRTV
YVKTRDPLNR TATVSDTIYL GGTAPLQELD DEYLSTTQPS VTLYRLDQSG WPMVQLSLGW
IADDAYPTFS RLWGNGERVT DPDAWGGTAY RLATSSGESS AWVWDTSFIK DTPLVAYFRI
KVSSNESSSE VAQLTVIGGG TEYGPLRLRG VEFSAPNRYQ EFALPFTFHS NPNEPFLIFQ
IRRSGAADVF VDAITIFSAS QPATSPLIWQ VPGGNYRGQG VWARFTDGAR FSEMTEAATM
PARLSVTPAG LRFLAQRSGA PPLSAHLRVA ATCAPPGWQA TQDVPWLQTE IVGSAVRVRA
NQSGLNNGVY TGSITISAPD AGVAPIVVPV ELIVVDQLFS VHLPLINKN