Gene Rcas_3983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3983 
Symbol 
ID5541493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5193882 
End bp5196200 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content61% 
IMG OID640896095 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001434034 
Protein GI156743905 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0445989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCACA ATGGATCAAC GTTTCGCTAC CTGGGCAAGG GGCATCGCCT GATCGAAGGG 
TTGGAAAAGG TAACCGGCAA TGCGAAGTAC GCTGGCGATC TGAAGATCTC CGGTATGCTG
CACGCATGCC TGGTGTTAAG CCCGTATGCG CATGCGCGCA TTGTTTCGAT CGATGCCAGC
GCTGCGCGGG CAATGCCGGG TGTTGTTGCG GTGCTGACCG CCGATGACCT GCCAACGCGC
GATCGTGCCG TTAATTCGCG CCACAGTGCC GTGCTGGCAA AGGATCGTGT GCTCTGGCGC
GGGCAACCGG TTGTGGCGGT CGTCGGTGAA ACCGAGACAG CGGCGCGTGA TGCCGCCGAT
CGCGTGGTCG TCGAGTATGA GCCGCTGCCG CCTGTCGTAG ATGTGCGCAA AGCCGCCAGT
CCCGATGCGC CGGTCATCTG GACCGAAGGG TTGCCGAAAG AAGGCGCCGA CCTGACCGCA
GCCCACGCTG CGGTCGACAA AGGCGAACAG GAAACAACCG GCGCACCCTC GAATATTCAC
GACGAGGTGC ATTTTGCGCG CGGCGACGTC GAGCGCGGCT TCACAGAAGC CGATGTGATC
ATCGAACGGG TCTACTGCAC CCCGATGGTG CACCAGGGTT ATCTGGAGCC GCATGCGTCG
GTTGCTGAGC CAGACCCATA CCGTGGCGGC GTGACCGTGT ACGCCAGCAC ACAGGGTCAG
TTTAGCGTGC GCGACGAAGT AGCGCGTCTC CTGTCGCTGC CCCGGCATAA GGTGCGCGTC
GTGCCCATGA CGATTGGCGG CGGTTTTGGC GCCAAGTACG GTATTATCGA TCCGCTGGTC
GCCGCTCTTG CTGTGACCGT CAAACGTCCG GTGCGTCTTG TGCTCACCCG CACTGAGGAT
TTTCTTTCAA CAACGCCTTC GCCAGCAGCA ATCGTGGAAC TGAAGGTTGG CGCGCGCGCC
GACGGCGCAC TGACGGCAAT CCAGGCGCGG GTGTTGATGG ACAATGGCGT CTTTCCCTTT
ACCCTGGGAG GTATCGTCAG CATTCTGCTC GGCGGCTATT ATAAGTGCCC AAATGTGAAG
ATAGATTGTT ATGAGGTGCT GACGCACAAA CCACAGGCAG GCGCCTACCG CGCTCCTGGA
GCACCAACCG CCACCTTTGC CATCGAGTCG ACCATCGACG ATATTGCCCG CGCGCTCGGG
CGCGATCCGC TCGCATTTCG GCTGCAAAAC GCTGCCGAAA CCGGCGATCC GATGGGCAAT
AACGATCCCT GGCCCCCTAT TGGGCTGAGA CTGGTGCTTG AGCGGCTACG CGACCATCCC
GCATGGAAGG ATCGAGAGGT CGGTCCCAAC GAGGGTGTCG GCATTGCCGT TGGCGGATGG
CCCTGTGGCA TGTCACCCGC TGCTTCTGTC TGCCGCGTCG ATACCGATGG GACTGTGCGC
GTCCACGTTG GATCGGTCGA TATTTCCGGC GTCAATTCGT CGCTTGTGCT GGTGGCTGCC
GAGATTCTCA ATATTCCGCC CGAACAGGTG GAACTGATTC AAGGCGATAC GCGCAGCGGT
CCCTTTGCCG GTCCGTCTGG CGGCAGCCAG ACAACCTACA GCGTAGCGGG GGCGGTTGCG
AGTGCAGCGC GCGCCGTGCG CGAGAAATTG TTCCATGTAG CGGCAGACCA CTTCGAAGCC
AGCGCCGCCG ACCTCGAACT TCGGAACGGC ATGGTGAGCG TCAAAGGCTT CCCCGACAAA
GCGATCTCGA TTGGCGAACT GGCCGCCATT GCCGAGAGCA AGGCTGGCGG ACCAGGACCG
ATCGTTGCCG AGGGCAGCGC CGCCGTTTCA GAAAATGCAC CCGGTTTCGT GGCCCATCTG
GCGAAGGTGC ATGTCGATCC CGAGACTGGA CAGGTGACGT TAAAACAGTA CGTTGCCATT
CAGGATGTCG GATTTGCCCT CAATCCGACG ATGGTTGCCG GTCAGATCCA TGGCGGCTCG
GTACAAGGCA TTGGCTGGGG ATTGTACGAA GCAATGGTAT ACGACGAGTA CGGTCAGTTG
CTGACTGCCA GTTTCATGGA CTACAACCTG CCGGCGTTCG ATCAGGTCCC AGATATCGAG
ACAGTTCTGG TTGAAAATCC CTCGCCGCAT GGTCCCTTCG GCGCACGCGG TGTCGGTGAG
CCGCCGATCA CGGCTGGCGC AGCAGCGATT GCCAATGCCA TCCGCGATGC CACCGGCGTG
CGCGTCACCG AGATTCCCAT TCGCGCAGAA ATGCTGTGGC GGGCAATACA AGTTGGGGCA
GGTCTGAGAC CTGCCCCGAC GCCTATGGGC AGGGACTGA
 
Protein sequence
MSHNGSTFRY LGKGHRLIEG LEKVTGNAKY AGDLKISGML HACLVLSPYA HARIVSIDAS 
AARAMPGVVA VLTADDLPTR DRAVNSRHSA VLAKDRVLWR GQPVVAVVGE TETAARDAAD
RVVVEYEPLP PVVDVRKAAS PDAPVIWTEG LPKEGADLTA AHAAVDKGEQ ETTGAPSNIH
DEVHFARGDV ERGFTEADVI IERVYCTPMV HQGYLEPHAS VAEPDPYRGG VTVYASTQGQ
FSVRDEVARL LSLPRHKVRV VPMTIGGGFG AKYGIIDPLV AALAVTVKRP VRLVLTRTED
FLSTTPSPAA IVELKVGARA DGALTAIQAR VLMDNGVFPF TLGGIVSILL GGYYKCPNVK
IDCYEVLTHK PQAGAYRAPG APTATFAIES TIDDIARALG RDPLAFRLQN AAETGDPMGN
NDPWPPIGLR LVLERLRDHP AWKDREVGPN EGVGIAVGGW PCGMSPAASV CRVDTDGTVR
VHVGSVDISG VNSSLVLVAA EILNIPPEQV ELIQGDTRSG PFAGPSGGSQ TTYSVAGAVA
SAARAVREKL FHVAADHFEA SAADLELRNG MVSVKGFPDK AISIGELAAI AESKAGGPGP
IVAEGSAAVS ENAPGFVAHL AKVHVDPETG QVTLKQYVAI QDVGFALNPT MVAGQIHGGS
VQGIGWGLYE AMVYDEYGQL LTASFMDYNL PAFDQVPDIE TVLVENPSPH GPFGARGVGE
PPITAGAAAI ANAIRDATGV RVTEIPIRAE MLWRAIQVGA GLRPAPTPMG RD