Gene Rcas_3649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3649 
Symbol 
ID5541151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4776810 
End bp4778912 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content61% 
IMG OID640895769 
Productfibronectin type III domain-containing protein 
Protein accessionYP_001433716 
Protein GI156743587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.483287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.101583 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAC CACTCTCCCG ACCTGGGCGC ATCGCGCTTG CCCTTGCGCT CATTGTCAAC 
GCCACGTTCG TGTTGCTTGC GTTACCATCA GTCGCTCTCG CTTCCGGTTC CATCGTGTAT
GTCGCTCCTT CGGGAGCCGA TACGCCAACG TGTGGCAGCG CAAGCCAGCC CTGCGCCACG
CTGGAAGCCG GTCTTGAACG GATCGTTCAT CCTGCGACCG GTCGATATGC CGGTGAAATC
CGTCTGGCTG CCGGAACATA CACCTACCCA TCGAATCGAG CGGTCGCCTT TATCTGGCGC
AGAGGCGTCG TCACCATACG CGGCGGGTAT TCGACCAGCG ATTGGAACAC CAGCAATCCC
ACGGCGAATG TCACAATCCT CGATGGTGAG AACGTCCGGC GTGGCATCTG GATGACGAAC
AGCACGTTCG ACGCGGCGCA GTGCGTGGTG ACCATCGAGG GGTTGACCGT GACGCGCGGG
CGTGCTCGCA GCGATGATCC GTCGGGCGGC GGGATTCTGA ACGATAGTTG CACGCTGACA
TTGCGCAATG TGACGATTAC GAACAGCGTG GCGCGCGGAC AGGACAACAC GAACGCCAAT
CCTGCAACCT CTCCGGGAAG CGGCGGCGGT CTGTCCGTGA GAGGGGTTCC CGACAACCCG
GCAGTCGCAA CCCTCGAAAA CGTTGTGATC AGCAGCAACC AGGCAATTGG CGGGAACGAT
AGCGCGTCGG CGCCGCGTGG CGGCCTTGCA TCCGGCGGCG GTCTCTTCGC CATCAATGCG
CGTGTGCGTG CTGTTCATCT GCGGGTCGAG AACAACATCG CGCGCGCGGG TGATGCGCCA
GGGTCTGCTG GCGTCACCGG CGTCATCCAG TTTGCCGATG GGCTTGGCGG CGGCATCTTT
TTTGCATTGA TGCCGTCGTT CGAACTGGAT CGTGCGCAGG TGCTCAACAA TCGGGCAGAA
GGCGGCGACG CCGGCAATCA GGGTGGTCTG GGGCTTGGCG GCGGCATCAA GATCGAACTG
AGCGCCGGCA GCATAACGAA CAGCATCATT CAGGGCAATC GCGCGGAAGG CGGCGCCGGC
GCGAACGGCG GCGCAGGGCA TGGCGGCGGC ATCTTCTCCA CCGGCTCAAC AGTGACCATA
GAGAATGTCC ATGTTCTCCA GAACAGCGTC CAGGGCGTCA ATGGATCGAC CGTCGGCGGC
AACGGCGGCG GCGGCGGCAT CTATTTGATC ACCACACCCG GCGCCGGCAG CACCGTTACT
GCCAGTAACA TTGTGATCGC CGGCAACAGT GCGCAGGCGG GCAATGGCGC AGCGCCGACC
GGTGGCGGTC TCGACTGTGT TGACACACAG TTGACGCTGC GCCACGCAAC GATTGCCAAC
AATGAGTTGC AGGGGTCTGT CGCCGGTCTT GGTCCGGCGA TCCGGCTGGT CGGCACATCG
TCGTCCGCCG GTTGCGGCGG TGAGATCAGC AACAGTATCA TCGCCAACCA TACCGGTTCG
CCGATCTATG CCAGCAGTTC GATCAAGCCG TTTACCGTCG CTCGTGTGCT GTTTTTCAAC
AATGGCAGCC CGAATATAAC GGTAGCAGGC AGTGCGCCGA TGACCGAGCA GAACTCGTTG
AGCGGCAACC CGCAGTTCGT CTCGCCAGGC GCGCCGAATT TCGATTACCA CATTCAAGCC
GGTTCCGCCG CAAGGGATCA GGCGCTGAAT AGCACGACCG CCGGCGACAT TGATGGACAG
GCGCGCCCAT TTGGGGCTGC CAGCGATGTC GGCGCCGATG AATATTCGAC CGAAATTCCG
CTCGCGTTTT CGCGCATTCC GCGCGGGGTG ACCCTTTCCT GGCGCACCCC GCCGATTCTT
TTCATCACCC AGTATCGGAT TGAATACACG AAAAGCGTCG GCGCGAACGA CGCTCTTGAA
GGTCCGTCGC CGATTCCACA GCCTGTCTCA ACCACCACGC TGACGCTGAG CGGTTTGACA
CGTGGCGCTA CCTACACGAT TGTCGTCGTC GGGTTGAATG GCGGGAGTGA AGTTGGGCGC
TCGCAGGAAG TGACACTCAC CATCTGGCAG CACGAAGTGT TTTTGCCGCT TGTGGTAAGA
TAG
 
Protein sequence
MDQPLSRPGR IALALALIVN ATFVLLALPS VALASGSIVY VAPSGADTPT CGSASQPCAT 
LEAGLERIVH PATGRYAGEI RLAAGTYTYP SNRAVAFIWR RGVVTIRGGY STSDWNTSNP
TANVTILDGE NVRRGIWMTN STFDAAQCVV TIEGLTVTRG RARSDDPSGG GILNDSCTLT
LRNVTITNSV ARGQDNTNAN PATSPGSGGG LSVRGVPDNP AVATLENVVI SSNQAIGGND
SASAPRGGLA SGGGLFAINA RVRAVHLRVE NNIARAGDAP GSAGVTGVIQ FADGLGGGIF
FALMPSFELD RAQVLNNRAE GGDAGNQGGL GLGGGIKIEL SAGSITNSII QGNRAEGGAG
ANGGAGHGGG IFSTGSTVTI ENVHVLQNSV QGVNGSTVGG NGGGGGIYLI TTPGAGSTVT
ASNIVIAGNS AQAGNGAAPT GGGLDCVDTQ LTLRHATIAN NELQGSVAGL GPAIRLVGTS
SSAGCGGEIS NSIIANHTGS PIYASSSIKP FTVARVLFFN NGSPNITVAG SAPMTEQNSL
SGNPQFVSPG APNFDYHIQA GSAARDQALN STTAGDIDGQ ARPFGAASDV GADEYSTEIP
LAFSRIPRGV TLSWRTPPIL FITQYRIEYT KSVGANDALE GPSPIPQPVS TTTLTLSGLT
RGATYTIVVV GLNGGSEVGR SQEVTLTIWQ HEVFLPLVVR