Gene RoseRS_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3438 
Symbol 
ID5210415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4311519 
End bp4314641 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content61% 
IMG OID640597033 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001277746 
Protein GI148657541 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.160486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG ACGCCATCGT TATCAAGGGT GCGCGTGAGC ACAATCTGAA GGGCATCGAC 
CTCGAAATCC CACGCGACAA ACTGGTTGTG CTGACCGGCG TCTCAGGTTC GGGAAAGTCG
TCGCTGGCGT TCGATACGCT GTATGCCGAA GGACAGCGCC GGTACGTCGA GTCGCTCTCG
GCATACGCCC GTCAGTTTCT CGGGCAGATG GAGAAGCCGA AAGTCGATTA CATCGGCGGT
CTCTCGCCGG CGATTGCCAT CGAGCAGAAG AGCGCGTCGA AGAATCCGCG CTCGACCGTC
GGCACCGTCA CCGAGATCTA CGACTATCTG CGCCTGCTGT ACGCTCGCGT TGGAACGCAG
CACTGCCACA TGTGCGGGCG ACCGGTCAGT TCGCAAAGCG CCGAGCAGAT CGTCAACCGC
GTGCTGACGT TGCCGGCTGG CACACGCTTC ATGGTGCTGG CGCCGCTCGT GTCGCAACGC
AAAGGCGAGT ACAAGGACGT GTTCGCCGAA GCGCGCGCCG AGGGATTCGC GCGGGTGCGC
GTCGATGGCG AAATACGCGA CCTGGCAAGC GAAATCAAAC TCAACAAGAA GGTCAAGCAT
ACTATCGAGA TTGTGGTTGA TCGTTTGACC ATACTGGCAC GCGAAGGTGC AACCGATCAG
TCGCATGTTC CGAGCGCCCC GATCGGAAAA GCGCAGGCAG GGGCGCAGAG CGATTGGGAT
GCATTCGTCT CTCGCCTGAC CGATAGTGTC GAACAGGCGC TGCGTGTTGG CGAGGGGCAA
CTGGTTATCA GCATCCAGAA TCCATCCGGT GGCACAGAAG AATGGTTGAT GAGCGAAGCC
AACACCTGCG TCCACTGTGG CATTTCGTTT CCTGAACTGT CGCCGCAGAT GTTTTCGTTC
AACAGTCCGC AGGGCGCCTG CCCCGAATGC ACCGGTCTCG GCGTTCGGAT GGAGGTGGAC
CCGCTGCTGC TCGTGCCCAA CCCATCATTG ACCCTGCACG AGGGTGCGGT GACCTACTGG
GGCGAACTGC GCAAGAAACG CGATTCGTGG GGGTACCGGG CGTTGCTGGC AATTGCGCAT
CACTACGGCT TCGATCTCGA TACGCCATGG GAACAACTCA GCGAGCAGGC ACGCCACGTC
ATTATCTATG GCAGCGGAAA GGAACGAATT CGCTTTCGCT GGGGCGACGA AACCAGTGAT
AGTCGTGGTG AGTTCATGCG TCCCTGGGAG GGACTGGCAA GTGAAATTCG TCGCCGCTAT
CAACAGACCG GCAGTGATTA CACCCGCGAG TATTACCAGA GTTTTATGAG CGAACAACCC
TGCCCGGCAT GCGACGGTGC GCGTCTGCGC CCCGAAAGCC TGGCGGTCAG GGTCGGCGGG
TGGTCGCTGC GCGATGTGAC GCGCCTGACG ATTACCGGTG CGATGGCATG GGTGCACGCC
CTGAGCGGCA TGCCGGTAGA CCCGTCGCAT CTGGCGGCAT TGAACGGGCA TGTCGCAGGC
AATGGCGCCA TACCCCACCT CAGTGTGACA CCACTGAGCG ACTACCAGAT GGCGATTGTC
AGCGATGTGC TCAAGGAGAT CCGCGAGCGC CTGGGGTTCC TGCTCAATGT CGGTCTGCAT
TACCTCACCC TCGAACGCCC CGCGCCAACC CTCTCCGGCG GCGAGGCGCA GCGTATTCGC
CTGGCATCAC AGATCGGCTC CGGTCTCGTC GGCGTCACCT ATATCCTCGA TGAACCGAGC
ATCGGGCTGC ACCAGCGCGA CAATCGCAAA CTGCTCGATA CGCTGCTGAA ACTGCGCGAT
CTGGGCAATA CCGTCGTCGT CGTCGAGCAC GACCTGGAAA CCATGCAGGC GGCTGACTGG
ATCATCGACT TCGGTCCTGG GGCAGGCGTC AAGGGGGGCG AGGTGGTCGC AGCCGGTCCT
CCTGACCTGA TCGCCGCAAA CCCTGGCTCC CTGACCGGCG CATACCTGTC CGGGCGATTG
GACATTCCGA TCCCGCAGCA GCGCCGCACT GCGCGGGTGC GCCCGGTTGC CGATACGGCG
CAGGACGCGC CGCGCCGTCG TCGCCGGACT GATCACGCAA CCGACCAGGC GGATGGTCCG
TGGCTCGAAC TCGAAGGCGC AACCATGAAT AATCTGCGCG ACGTGACCGT TCGTTTTCCG
CTCGGCGTCT TCATCTGCGT GACCGGCGTC TCCGGGTCGG GAAAATCATC GCTGATCACC
GAAACGCTCT ACCCTGCGCT GGCAAACCGC CTGAACCGCG CGCAGTTGAA GCCGGGACCG
TTCCGTACAT TGCGTGGGCT GGAACATCTC GATAAGGTGA TCGATATCGA CCAGCAACCG
ATTGGGCGAA CCCCGCGCTC CAACCCGGCA ACATACGTCA AACTGTTCGA CCTGATCCGC
GAACTGTTCG CTTCGACCAA TGAGGCGAAA CTGCGCGGCT ACAACGCCGG GCGCTTTTCG
TTCAACCTGA AGGGCGGGCG TTGCGAAGCC TGCGAGGGGA ATGGCGAAAA GCGCATCGAC
ATGCAGTTCC TGGCGGATGT CTGGGTGCGC TGCGATGTCT GTAAGGGGAA ACGGTACAAC
CGTGAAACAT TGCAGGTCAG GTACAAGGGC AAGTCCATTG CTGACGTGCT CGACATGGAC
GTGCAGACGG CGCTGGAGTT CTTCGACAAT GTGCCGCGCA TCAGGCGCAT CCTGCAAACG
CTCCACGACG TCGGTCTGGA CTACATCAAA CTCGGTCAGT CGGCGACGAC CCTTTCCGGC
GGCGAGGCGC AGCGGGTGAA ACTGGCGAAA GAACTGGCGC GCACTGCTAC CGGTCGCACC
ATGTATATTC TGGATGAACC AACGACCGGG CTGCACTTCG CCGATGTACA ACGCCTGTTG
ACAGTGCTGC ACCGCCTGGT CGATGCAGGC AACACCGTGC TCGTCATCGA GCACAACCTG
GACGTTATCA AAACCGCAGA CTGGATCATC GACATGGGAC CGGAAGGCGG CGACGGGGGT
GGCAGAGTCG TGGCGACCGG CACACCCGAA GAAGTGGCGC TGATCGAGGA GTCGCACACC
GGTCGATTCC TGCGCGAGAT CCTGCACCAC CACAACATCG TTGCCAGGGG CGTGCTTGAG
TGA
 
Protein sequence
MAKDAIVIKG AREHNLKGID LEIPRDKLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS 
AYARQFLGQM EKPKVDYIGG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RLLYARVGTQ
HCHMCGRPVS SQSAEQIVNR VLTLPAGTRF MVLAPLVSQR KGEYKDVFAE ARAEGFARVR
VDGEIRDLAS EIKLNKKVKH TIEIVVDRLT ILAREGATDQ SHVPSAPIGK AQAGAQSDWD
AFVSRLTDSV EQALRVGEGQ LVISIQNPSG GTEEWLMSEA NTCVHCGISF PELSPQMFSF
NSPQGACPEC TGLGVRMEVD PLLLVPNPSL TLHEGAVTYW GELRKKRDSW GYRALLAIAH
HYGFDLDTPW EQLSEQARHV IIYGSGKERI RFRWGDETSD SRGEFMRPWE GLASEIRRRY
QQTGSDYTRE YYQSFMSEQP CPACDGARLR PESLAVRVGG WSLRDVTRLT ITGAMAWVHA
LSGMPVDPSH LAALNGHVAG NGAIPHLSVT PLSDYQMAIV SDVLKEIRER LGFLLNVGLH
YLTLERPAPT LSGGEAQRIR LASQIGSGLV GVTYILDEPS IGLHQRDNRK LLDTLLKLRD
LGNTVVVVEH DLETMQAADW IIDFGPGAGV KGGEVVAAGP PDLIAANPGS LTGAYLSGRL
DIPIPQQRRT ARVRPVADTA QDAPRRRRRT DHATDQADGP WLELEGATMN NLRDVTVRFP
LGVFICVTGV SGSGKSSLIT ETLYPALANR LNRAQLKPGP FRTLRGLEHL DKVIDIDQQP
IGRTPRSNPA TYVKLFDLIR ELFASTNEAK LRGYNAGRFS FNLKGGRCEA CEGNGEKRID
MQFLADVWVR CDVCKGKRYN RETLQVRYKG KSIADVLDMD VQTALEFFDN VPRIRRILQT
LHDVGLDYIK LGQSATTLSG GEAQRVKLAK ELARTATGRT MYILDEPTTG LHFADVQRLL
TVLHRLVDAG NTVLVIEHNL DVIKTADWII DMGPEGGDGG GRVVATGTPE EVALIEESHT
GRFLREILHH HNIVARGVLE