Gene Rcas_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1972 
Symbol 
ID5539450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2523207 
End bp2525750 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content65% 
IMG OID640894107 
Productvon Willebrand factor type A 
Protein accessionYP_001432078 
Protein GI156741949 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.865773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTTC GCAACCCGCA GTTTCTCCTG CTCCTGATGC TCGTGCCGCT GATCATCGGC 
GGTTGGCTGT GGCGGCGTGG TCGCCTGCCG GCTGCGGCGC TCGTGCTGCG CGTGATCATT
GTGATCCTCA TTGTCGGCGC GCTGACCAAC CCGATTGCCC TGCAAGGGCA TATCGACAGC
GCAGGTGCGC CGCTCGTCGT GGTGCTGGTC GATCAATCGG ACAGCCTGAC CGACGAAGGC
AAAGCAGCGT TGCGCGCGCG CGCTGCGGCG CTTGCCGCAA ACAGCAGCAG CCCGGTGCAA
ATTATCACGT TCGGAGCAAC AGCAGCCGCA GAACAGAGCA CCCCCCCCGG CGATCAAACC
GATATCGCCG CAGCGTTGCG CGCCGCGCGC GGATTGATCG GCGACGGCGG CGGGCGCGTG
GCGCTCCTCT CAGACGGACT CCAGACCAGA GGCGATGCGC TGGCTGAGGC AAGAGCGCTC
GCAGGCGCCG GCATCCCGGT CGATACTGAA TATTATCAGG CGCCTGCACG ACCAGAGTTC
TGGATTGCAA CCATCGAAAC GCCGCCGACA TTGCGCGAAG GCGAAGAGTT TACCGCCCGA
ATCGTCATGG CCAGCACCGT CGCTGCCAAC GCGCAACTCG AACTGACCGT CGGCAATGAG
CGCGTGCTGG CGCAGCAGGT GCGCCTGGCG CCAGGTGAGA ACAGTGTCCC CTACACCGGT
CGCGCGGGGC GACCGGGCAT TCTGCGGATG CAGGCCACGC TGACCGGTCA ACCGGATACG
CTGACGCGCA ACAATGTTGC CGGTGCGACG GTGCTGGTGG CGCCGATGCC GCGCATTCTG
CTGGTGGAAG GACCGAACGA CGTCAACAGT GCGCCATTGC GCAGCGCATT GCGTGAGGCT
GGTGTAATGG CGGATGTGGC GGAGGCGGCA TCGCTCCCCG CACAGATCTC GGCGCTCGGA
TTGTATGAAG GGATTGTACT GATCGATGTG CCTGCCGGTG TCTTGAGCCT CGACCAGATG
GCAACGCTGC GTGAGTTCGT GCGCAGCGAG GGGCGCGGGT TGCTGGCAAT CGGCGGGCGA
TCCAGTTTCA CCCTTGGCGC TTATAAGGAC ACGCCACTGG AGGAAACGCT CCCCGTAACC
ATGGTTCCGC CGCCGCGTCC CGAACGCTCC GATACGACAC TGCTCCTGAT CATCGATCAG
TCCGCCAGCA TGGGACCGGA GACCGGCCTT TCCAAATTCA CCATGGCGAA GGAAGCCGCC
ATCATGGCGA CCGAATCGTT GCGCGCCGAG GATCGTATTG GCGTGCTGGC GTTCGATGTA
TCGACACGCT GGGTGGTCGA CTTTCAGCCG GTCGGAACGG GGTTAAGCCT GGCGGATATT
CAGCGACGGA TCAGCACGCT GCCGCTTGGC GGCGGCACCG ACATTTACAA CGCATTGCAA
ACCGGTCTGC CGGAGTTGGC GCGCCAACCG GGGCGGGTGC GTCATGCTGT GCTCCTTACC
GACGGTCGCT CCTTTACCGA TGATCGACAG GCGTATCAGG CGCTGATCGA GGAGGCGCGC
AGCCGGAATA TCACGCTTTC GACCATCGCA ATCGGAACTG ACGCGGATAT CGACCTGCTC
CAGACGCTGG CGCGCTGGGG CGCCGGGCGC TACTACTTCG CTGCCGAACC GGGCGATATT
CCACGCCTGA CGCTGCTCGA AAGCGAAATC GTGCGCACCG AGCCGCAGGT CGAAGGAGAT
TTCCGCGCTG AACAAAAGGC GCCCCACCCG ATGCTGCGCG ATTTTGCCCC GGCACAGATA
CCCGGACTGA AGGGGTATGT CGCCACAACC CTGAAACCCG GCGCCGACCT GGTGCTGCAA
TCGCCCGACG GCGATCCCGT GCTGGCGGTC TGGCAGTATG GATTGGGACG CGCCGCTGCC
TGGACGCCGG GCGCAGAAGC GCCATGGGCT GCCGATTGGT CGAACTGGCC CGAATATGGG
CGCTTTTGGG CGCAGTTGAT CCGCTACACG CTTCCCGAAC CGGATAGTGG ACCGCTCCAG
GTGCGCGTCG TCCGCGACGG TGATAGCGTG CGCATCGTGG CCGATTCGGT CGCGCCGGGA
GGCAGACCGC TCGACCTGGC AGACACTCAG GCAACGATTG TGCTGCCCAA CGGCGCCGCG
CAACTGATTA CGCTGCGTCA GACGGCGCCA GGTCGCTATG AACAGGCGCT CATCCTGCCG
GACGACGGAC CCTACGCAAT TGAAGTGCGT CAGCAAAAAG GGAGCGAGGT GCGCGCAGCG
CAGGCCGGCT ACGTGCAACG CTACTCCGAC GAATACCTGC CCCCCGCCGA TCCGCAGACC
GGTGCGCGGC TGATGAACGA CATCAGCGCA ATCACCGGCG GGAATGCACT GGGCGGAGGC
TCACTCATCG CGCCGGGAGG CGGCGCCATC CGCGCCGAAC GGCTGCCCGA CACCGGCTTC
TGGCCCTGGC TGCTTGGTCT GGCGGCGCTG CTCTGGCCCC TCGAAATCGC CATCCGGCGC
GGATGGGTGC GGTTGCGTCG ATGA
 
Protein sequence
MILRNPQFLL LLMLVPLIIG GWLWRRGRLP AAALVLRVII VILIVGALTN PIALQGHIDS 
AGAPLVVVLV DQSDSLTDEG KAALRARAAA LAANSSSPVQ IITFGATAAA EQSTPPGDQT
DIAAALRAAR GLIGDGGGRV ALLSDGLQTR GDALAEARAL AGAGIPVDTE YYQAPARPEF
WIATIETPPT LREGEEFTAR IVMASTVAAN AQLELTVGNE RVLAQQVRLA PGENSVPYTG
RAGRPGILRM QATLTGQPDT LTRNNVAGAT VLVAPMPRIL LVEGPNDVNS APLRSALREA
GVMADVAEAA SLPAQISALG LYEGIVLIDV PAGVLSLDQM ATLREFVRSE GRGLLAIGGR
SSFTLGAYKD TPLEETLPVT MVPPPRPERS DTTLLLIIDQ SASMGPETGL SKFTMAKEAA
IMATESLRAE DRIGVLAFDV STRWVVDFQP VGTGLSLADI QRRISTLPLG GGTDIYNALQ
TGLPELARQP GRVRHAVLLT DGRSFTDDRQ AYQALIEEAR SRNITLSTIA IGTDADIDLL
QTLARWGAGR YYFAAEPGDI PRLTLLESEI VRTEPQVEGD FRAEQKAPHP MLRDFAPAQI
PGLKGYVATT LKPGADLVLQ SPDGDPVLAV WQYGLGRAAA WTPGAEAPWA ADWSNWPEYG
RFWAQLIRYT LPEPDSGPLQ VRVVRDGDSV RIVADSVAPG GRPLDLADTQ ATIVLPNGAA
QLITLRQTAP GRYEQALILP DDGPYAIEVR QQKGSEVRAA QAGYVQRYSD EYLPPADPQT
GARLMNDISA ITGGNALGGG SLIAPGGGAI RAERLPDTGF WPWLLGLAAL LWPLEIAIRR
GWVRLRR