Gene Rcas_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2978 
Symbol 
ID5540470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3860978 
End bp3862975 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content60% 
IMG OID640895096 
Productexcinuclease ABC, C subunit 
Protein accessionYP_001433053 
Protein GI156742924 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000426335 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGATC TCATGCATAC GACGGTTGCC GATCCAATCG CCCTTGAAGA ACGGTTGCGC 
GCCGTGCCGC TTGCGCCTGG GGTGTACCTC TGGAAGAACG CCCAGGGGAA AATTATTTAC
ATCGGTAAGA GCAAACGGCT GCGTGATCGT ATGCGTTCCT ACTTTGCCCG CACCGACGAT
CCTTACGGCA AAACGGCGCG GCTTGTGGCG CAGATCGCCG ATTTCCAGGT GATCGTGACG
TCCAACGAGC TTGAGGCGTT GCTGCTCGAA ATGACCCTGA TCAAGCAGCA CCGACCGCGC
TTCAACGTTT TACTGAAGGA CGACAAAAGC TACCCGTACA TCAAGGTGAC GCTGCACGAG
CCGTGGCCCC GCATTGTTGC GACGCGCAAC CCGCGTTGGG AAGAGGGGGC GCGCTACTTC
GGACCGTACT CCAGCGCCGG CGCCGTCTAT CGCACGCTCG ACCAACTCAA CCGGCTGTTC
GCCTTTCGCC CGCCGTCGCG TTGCCCTGAC GATAAGTTCA ACCGCCATCG GCGGCTCGGC
AAACCGTGCC TGTACTACGA CATCAAGCGG TGCCTGGGTC CGTGCGTGCC GGGTCTGGTC
AATCAGAACG ATTATCGCGC AACGGTTGAA TCGGTCTGTC GCTTCCTCGA AGGCAAGAGC
GACCTGGTGG CGAAAATATT GCGCCGTCAA ATGGAAGAAG CGGCGGAACG GCTCGACTTC
GAGCGCGCCG CGCGACTGCG CGACAGCATC CGCGACATTG AACTGATCGG TCAGCGGCAA
CAGGTGTTAC GCCACGATGA CGCCGACCAG GATGTCATTG GACTGGCGCG CGAGGAGGGG
ATGGCAGTTG TTCAGGTGCT CCGCATTCGC GCAGGAAAAT TGATCAGCGC CGAGTCGTTC
CCGTTGCAAA ATGCCGAAGG GGAACGTGAT GAATCTCTGC TGGCTTCGTT CCTGACGCAG
TTCTACGATG CTGCCGCCGA ACTTCCGGCG ACACTGCTGC TGCCTGCGCC ACTCGACGAC
CTGGCAATCA TCGAGCAATG GCTGGCGCAA AAAGCCGGGC GCAAAGTTGC GCTGCACACA
CCGCAACGTG GTGAAAAGCG CCGCCTCGTC GAACTTGCCG AGCAGAACGC TCGCCAGAAA
CTCGATGAAT TGCGTCTGCA ATGGCTCAAC AGCGAACAAC GCGCCGTGGC GGGGTTGACC
GAAGTGCGCG ATCTGCTCGG TTTGAGCGCA CTGCCGACAC GCATCGAGTG CTTCGATGTT
TCCAACACGC AGGGCAGCCA TTCGGTTGGG GCAATGGTCG TCTTTGAGCA TGGTGAACCG
AAGAAGAGCC GCTACCGCAA ATTCAGGATC AAAACCGTTG AGGGCGCGAA CGATGTCGCT
TCGATCCAGG AAGTGCTGCG GCGGCGTTTC CGGCGTGCTG CCATGGTCAT CGGCGAAGAG
GAACAACCGG CGGACGAGCG CGTTGTCAAC GGTCAGACCG ATGCCGCAGA GCAGGAAGAT
GGGGAGAAGA CCGATGCGCC GGGATCACAG TCCGACCTCG AACGCCAGGA GACCTGGGCT
GAACTGCCCG ACCTCATCCT GATCGACGGC GGCATTGGCC AGGTGAATGG CGCATTACAC
GTGCTGCGCG ACCTGTGCTT CGAGCATATT CCGGTCGTTG GAGTCGTCAA GGGTCCGAAC
CGTGACCGCT TCGATCTGCT GATCCCCGGC GCGAGCGATC TCATCGTTCT CGAGCGCGAG
AGCGCCGCGT TGCGTCTTAT CCGGCGGATT GACGAAGAAG CCGACCGTTT TGCGAAAGAT
TATCACCGCA AACTGCGCAG CAAATCGGCG ACCGCGTCGC GCCTGGAAGA GATCCCCGGC
ATCGGCCCGA AGCGGCGCCA GTTGCTGCTC AAACGCTTTG GCTCACTCGA CGGCATTCGC
AACGCAACCG TTGACGAAAT CGCCGCCGTA CCGGGCATGA CGCGCAAGGC GGCTGAGGAG
TTGAAGAGCC TGTTGTAG
 
Protein sequence
MSDLMHTTVA DPIALEERLR AVPLAPGVYL WKNAQGKIIY IGKSKRLRDR MRSYFARTDD 
PYGKTARLVA QIADFQVIVT SNELEALLLE MTLIKQHRPR FNVLLKDDKS YPYIKVTLHE
PWPRIVATRN PRWEEGARYF GPYSSAGAVY RTLDQLNRLF AFRPPSRCPD DKFNRHRRLG
KPCLYYDIKR CLGPCVPGLV NQNDYRATVE SVCRFLEGKS DLVAKILRRQ MEEAAERLDF
ERAARLRDSI RDIELIGQRQ QVLRHDDADQ DVIGLAREEG MAVVQVLRIR AGKLISAESF
PLQNAEGERD ESLLASFLTQ FYDAAAELPA TLLLPAPLDD LAIIEQWLAQ KAGRKVALHT
PQRGEKRRLV ELAEQNARQK LDELRLQWLN SEQRAVAGLT EVRDLLGLSA LPTRIECFDV
SNTQGSHSVG AMVVFEHGEP KKSRYRKFRI KTVEGANDVA SIQEVLRRRF RRAAMVIGEE
EQPADERVVN GQTDAAEQED GEKTDAPGSQ SDLERQETWA ELPDLILIDG GIGQVNGALH
VLRDLCFEHI PVVGVVKGPN RDRFDLLIPG ASDLIVLERE SAALRLIRRI DEEADRFAKD
YHRKLRSKSA TASRLEEIPG IGPKRRQLLL KRFGSLDGIR NATVDEIAAV PGMTRKAAEE
LKSLL