Gene Arth_0977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0977 
Symbol 
ID4446154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1051193 
End bp1054384 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content63% 
IMG OID639688783 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_830474 
Protein GI116669541 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCTTTC CCTCACCCAC CAAATACCCT GGCGCAGTGA CTAATTCCAA TGGGGGAGTG 
GCGGCGGAGA AGCTGCCCGA GGGCTTGTAT GAACTACTGA ACACGAAACT ACTTGGTCAG
CGCCTTGACC AGGAAGTCGA ACTCCAGCCA ACCTTCGCTG ACATTGACAA TGAAGACGCT
CCGGACATTC TTTCTCGCCA CGTCGTCGAC GCCGTCCGGC AGGCCCTCTC CTCAGCAAAG
CCATCGGACA GGGTGGCTAT AGCAAACCGA CTTCTTCTAG GGCTGAAGCA CAGTGACCGA
ATCGCAGACG GCCCCACTCA GCTTCAATCC CTCCATCGCC CGGACGCCCT TAAGCGCCGC
CAACTCCGCC GTCCCAGCAC CAAGCTGAGT GACTCGGCGC TGCTGACCAA CAGCAAGGAC
GAGCCGAATC TCGCCGCGGA GCTCCGGGCC GAGATCGAGT CCGCCAACTC CGTGGACCTA
CTCTGTGCCT TCGTCCGATG GACGGGCCTC CGGCTGCTCG AACCCGCCCT TGAGCAACTC
AAGGAGCGGG GAGTCCGCCT GAGAGTCCTC ACCACCACGT ACATGGGCGC CACGGAGCGC
CGCGCTATCG ACGAACTCTT CAACCGCTAC GGCGCCGAGG TGAAGATCAA CTACGAAACT
CAGGCCACTC GCCTCCACGC AAAGGCCTGG CTGTTCCGCC GCAACTCCGG CTTCGACACC
GCGTACGTGG GGAGCTCCAA CCTCAGCCAA GCTGCCCTCC TCGATGGTCT TGAATGGAAC
GTCAGGCTCA GTTCCGTCGG CACGCCGACT CTCCTGCAGA AGTTCGAGGT CACGTTCGAT
AGCTACTGGG AGCAGCGTGC TTTCCAGAGC TATGATCCAG AACGCGACGG CGAGAAGCTC
GATGCTGCCC TGGAGCGCAA CGGCGGCCGG CGCACCGCAG CGCCGGACGC AGCCATCGGG
CTCGAAGTCC AGCCGTACCT CCACCAGGAG GAAATGCTTG AGGAGCTGGA AGCAGAGCGG
CTCAAAGGCT TCAACCACAA CCTGCTGGTC GCCGCCACCG GCACCGGCAA GACTGTCATC
GCTGCCCTCG ACTACAAACG CCTGTGCGAG GCTGCCGGCC GCGATCTGAA GCTGCTCTTC
GTCGCCCACC GGCAGGAAAT CCTGAAACAG GCGATGCGCA CGTACCGTGA CGTCATGCAG
GACGGCGCCT TCGGCGAACT CTACGTCGGC GAGCACAAAC CCGCGCAGTG GAAGCACATC
TTCGCCTCCG TCCAGTCGCT GTCCTCGCTC GGCATCGAAC AGCTGGAGCC AGATTTCTTC
GATGTAGTCG TGATTGATGA ATTCCACCAC GCCATGGCAC CCACCTACCG GCGTCTGCTG
GACCACCTGC AGCCGCAACA GCTTCTCGGA CTGACTGCGA CGCCGGAACG CGGCGACGGT
GTCGATGTCG CCAAGCAGTT CTTCGACGGG CGCACGGCGA GCGAATTAAG GCTTTGGGAC
GCCTTGGACG CTGACCTGCT GGTGCCGTTC CACTACTTCG GAGTCTCCGA CGACGTAGAC
CTGAGCCAGC TGGAATGGAA GCGCGGCAAC TATGACACCA CTCAGCTGAA CAATCTCTAC
ACCGGCAACG ACGCCCGCGC GGCCAAGGTG ATCCGCGAAC TCCGCGACAA AGTCACTAGC
ACGGACCAGA TGCGGGCCAT CGGCTTCTGC GTCTCGGTCC AGCATGCCCA TTACATGGCC
GAAGTGTTCA ACCGGGCAGG TATCGCTTCC GTTGCGGTCG ACGGCAGCAC CGACGACGCC
GACCGCGCAG CGGCGCTGGA GCGCCTCCGC CGGCGGGAGA TCAACTGCAT CTTCGCCGTG
GACCTTTACA ACGAAGGCCT GGACCTGCCG CAGGTGGACA CCATCCTGCT GCTCCGGCCC
ACGCAGAGCG CCACGATCTT CCTCCAGCAG CTGGGGCGTG GGCTGCGCCG TGCCGAGGGC
AAGGCCGTGC TGACCGTGAT GGACTTCATC GGCCAGCAAC GGCGTGAGTT CCGCTTCGAC
CTGCGCTACC GGGCTCTGAC CGGCTACAGG CGCAAGGAGC TGGAAAAGGC TGTCGAGGAC
GAGTTCCCCT ATCTGCCATC GGGCTCGCAG ATCGTGCTAG ACCGGGTGGC GCAGAAGGTG
GTGCTGGACA ACATCAAGGC ACAGCTGCGG TTCAACCGTG CACAGCTGGT CCGGGACATT
GCCTCATACG CGGAGACCGA GTTGGAGGCT TATCTGGAGC AGTCCCGGAA CGACGTGAAG
TCGATCTACC GTTCGACCAG GGATTCCTGG ACCGGTTACC TTCGCCAGGC GGGGCTGATC
GAGGGTTTCT CGCCGTTGGA GGCAGTTCTC AGCGGGAGGA TCAAGGAGCT CTCGGATGCC
GATGAGAAGA AGCTCCTGGG CCGGATGGCT GCGCTCATCC ACGTGGACGA CCCGGAACGT
GCTGAGGCGT ACTCGATGCT GGTCAGCCCC GGTGCGCCCC GCTACGCGGA GCTTGGCATG
CGTGAGCAGA CGTTTGCGCG GATGCTGTTC TACACGCTGT GGGACGACGG CGGCGGTTTC
CAGTCGTACG ACGCCGGACT GGACTACCTG CGCGGCTATC AGTTTGTTTG CAGCGAGATC
CGCCAGATCG TGAAGCTCGG GGTGGCCGCC TCCAAGCATG CCGCCAAGGG CTTGGGAGCG
GGGCTACAAC ACGTTCCGCT GCTCTCACAC GCGACCTACC GGCGCGAAGA GGTCCTGGCG
GCCCTGCAAT ATGGATCGCT GGAACTAGGC AAGAACGTGC AACACCGCGA GGGTGTGGCT
TGGTGCCCGG CGACGTCCAC CGATGCCTTC TTCGTCACCC TCAACAAGGA CGACAGGAAG
CACTCGGCGA CGACGATGTA CAAGGACTAC GCCATCAGTC CGGAACTCTT CCACTGGGAG
TCGCAGAACG CGACGTCACC CGGTAGCCCG ACTGGACGCC GATACCTTAA CCGGGAATCC
CATGGTTCGA AAATCTTGAT CTTCACGAGG GACACTTCGG AGGACGAGAC CGGCCTGACG
GTTCCGTACG CGTGCCTCGG GCAGGTGGAC TACGTGCAGC ATGCAGGGGA GAAGCCGATC
GCGATCACCT GGAAACTGCA TCGGCCGATG CCGGCGGACG TGTTTGCGAC GGCGGCAGCT
GTGGCTAACT GA
 
Protein sequence
MVFPSPTKYP GAVTNSNGGV AAEKLPEGLY ELLNTKLLGQ RLDQEVELQP TFADIDNEDA 
PDILSRHVVD AVRQALSSAK PSDRVAIANR LLLGLKHSDR IADGPTQLQS LHRPDALKRR
QLRRPSTKLS DSALLTNSKD EPNLAAELRA EIESANSVDL LCAFVRWTGL RLLEPALEQL
KERGVRLRVL TTTYMGATER RAIDELFNRY GAEVKINYET QATRLHAKAW LFRRNSGFDT
AYVGSSNLSQ AALLDGLEWN VRLSSVGTPT LLQKFEVTFD SYWEQRAFQS YDPERDGEKL
DAALERNGGR RTAAPDAAIG LEVQPYLHQE EMLEELEAER LKGFNHNLLV AATGTGKTVI
AALDYKRLCE AAGRDLKLLF VAHRQEILKQ AMRTYRDVMQ DGAFGELYVG EHKPAQWKHI
FASVQSLSSL GIEQLEPDFF DVVVIDEFHH AMAPTYRRLL DHLQPQQLLG LTATPERGDG
VDVAKQFFDG RTASELRLWD ALDADLLVPF HYFGVSDDVD LSQLEWKRGN YDTTQLNNLY
TGNDARAAKV IRELRDKVTS TDQMRAIGFC VSVQHAHYMA EVFNRAGIAS VAVDGSTDDA
DRAAALERLR RREINCIFAV DLYNEGLDLP QVDTILLLRP TQSATIFLQQ LGRGLRRAEG
KAVLTVMDFI GQQRREFRFD LRYRALTGYR RKELEKAVED EFPYLPSGSQ IVLDRVAQKV
VLDNIKAQLR FNRAQLVRDI ASYAETELEA YLEQSRNDVK SIYRSTRDSW TGYLRQAGLI
EGFSPLEAVL SGRIKELSDA DEKKLLGRMA ALIHVDDPER AEAYSMLVSP GAPRYAELGM
REQTFARMLF YTLWDDGGGF QSYDAGLDYL RGYQFVCSEI RQIVKLGVAA SKHAAKGLGA
GLQHVPLLSH ATYRREEVLA ALQYGSLELG KNVQHREGVA WCPATSTDAF FVTLNKDDRK
HSATTMYKDY AISPELFHWE SQNATSPGSP TGRRYLNRES HGSKILIFTR DTSEDETGLT
VPYACLGQVD YVQHAGEKPI AITWKLHRPM PADVFATAAA VAN