Gene Arth_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1842 
Symbol 
ID4445636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2071756 
End bp2072919 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content64% 
IMG OID639689660 
Productintegrase catalytic subunit 
Protein accessionYP_831332 
Protein GI116670399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGCGT TCTGTGCCGA GCATGGCATC TCCCGTAAGA CGTTTTACGT GTTGTTGGGC 
CGGGCCCGGG CCGGGGGCCC GGCTACAGCG TTGGAACCGC GGTCCCGTCG GCCGCGCACG
AGCCCGTCCA GGATCAGTGA TGAGGCCAAG GAGCAGGCAC TTCGGGTGCG GGGAGCGTTG
GAGCGCTCGG GCCTGGACCA CGGACCGATC AGCGTGTTTG AGAAGATGAA GTCCATGGGC
CTGGAACCGG TTCCCTCGGT TGCGTCGCTG GCCAGAATCT TCCGTCAAAG CGGTGTCGCG
AGGCTGGAGC CGCGGAAGAA ACCCCGGGCC GCGTACCGCC GGTGGCAGCT GGATGCGACC
GAGTACGTCC TCACCGGTGG CCGTAAATGC GTGATTTTCC AGCTCATCGA TGACCACTCC
CGTTTCGCGG TCGCCTCCCA CGTTGCCGCC GGGGAAACGT CCGAGGCCGC GATCGCGGTG
GTGAAGAAGG GCATCACCGC GCACGGGGTA CCGCAGAAAC TGCTCACGGA TAACGGGGCC
GCGTTGAACC CTTCACGGCG GGGCCATCAA GGCCAGCTCC TCACGTGCGT CACCTCCCTG
GGGATCGAGG CGGTCACCGG GAAACCTTAC AAACCAACCA CGCAGGGCAA GAACGAACGC
TTCCATCAGA CCCTATTCCG GTTCCTGGAC AAACAACCCC TCGCCAGGAC AATTGAACAG
CTTCAGGAGC AGGTCGGAGC GTTCGATCAG CTCTATAACA CCGAACGCCC GCACCAGGGT
TTGCCCGGGC GGATCACCCC GCGCCAAGCA TGGGCGGCCA CACCGGTCGC CGAGCCACCA
CACCCGAAAC CTGTCCCGGC CCTCACACAG GACAGGACCC GCGGCAGCGG CCAGGCCACC
CGCATCGCCT ACCCCAACGG CAGAGTCACG ATCAACAGCG TGGTCTACAT GATCGGCAGA
CCCTACGCCC GGCACTGCAT CCACGCGCTC TGGGACACCG AGATGATCCA GTTCTTCGAT
GACCAGGGCA CCCACATCAC GTCCTACCCC TGCCCGCCAG CCGGGACCAA ATCAGTCGGC
AACGGCAAGC CCCCAGGACG CACCATGAAA CAACCCCCAA CCGTCACCGA AGTCCTGACA
CACGACATGT CACCGATCTC CTGA
 
Protein sequence
MTAFCAEHGI SRKTFYVLLG RARAGGPATA LEPRSRRPRT SPSRISDEAK EQALRVRGAL 
ERSGLDHGPI SVFEKMKSMG LEPVPSVASL ARIFRQSGVA RLEPRKKPRA AYRRWQLDAT
EYVLTGGRKC VIFQLIDDHS RFAVASHVAA GETSEAAIAV VKKGITAHGV PQKLLTDNGA
ALNPSRRGHQ GQLLTCVTSL GIEAVTGKPY KPTTQGKNER FHQTLFRFLD KQPLARTIEQ
LQEQVGAFDQ LYNTERPHQG LPGRITPRQA WAATPVAEPP HPKPVPALTQ DRTRGSGQAT
RIAYPNGRVT INSVVYMIGR PYARHCIHAL WDTEMIQFFD DQGTHITSYP CPPAGTKSVG
NGKPPGRTMK QPPTVTEVLT HDMSPIS