Gene Arth_2472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2472 
Symbol 
ID4445061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2769071 
End bp2770255 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content68% 
IMG OID639690287 
ProductDNA protecting protein DprA 
Protein accessionYP_831951 
Protein GI116671018 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.156193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATC ATGAACGCAT AGCCCGTGCC GCCCTGTCGC GCCTAATGGA GCCGCAGGAC 
GCCGTCGGCC TGGCACTTGT CAGGACCGCC GGGGCAGTGG ACGGGCTCCG AATCGCCACA
GGCCAACTGG TGTCCGGTCC GCAGCTTGAA CAGGAGGTCA CGGCTTTGCT TGCGGAGAAC
GGGACAGCGA GCTGGCCCGG AATGAGCGCT TCGCTGAGGC GCTGGGCCCC GCGAATCCCG
GACCTCGCTC CGGAGCGGGA TCTCGCCACC ATGCACCGGC TGGGCGGCCG GATGATCATG
CCGTCCGATT CACTATGGCC CGGGCAGTTG GCCGACCTGG ACCTTCATGA GCCCATCTGC
CTATGGTGGC GCGGAACGGA ACACCCACTG CCTGCGGCGG CCAAGTCCAT TGCCTTGGTC
GGTTCCCGCG ACAGCACAAG CTATGGCGCG GCCGTCACGG GTGACTTGGC CTATTCGTTG
GCGCAGCGGG GCTTTACCAT CGTGTCGGGC GGGGCTTACG GGATCGATGC GCACGCCCAC
CGGGCCGCGC TGGCCGGTGC CGGCGACGCG ATGCCCACAA TAGCCGTCAT GGCCGGGGGA
GTGGACCGCT TCTACCCGTC CGGCAACGAA GAACTGCTCA GGACCGTCGC GAACCAGGGT
GCAGTCCTGG CTGAAGTACC GCCGGGCTCC GCCCCCACCA GGTACCGGTT CCTGCAACGG
AACCGCCTTA TCGCCGCGCT GTCCTCAGTC ACCGTTGTGG TGGAGGCCCG GTGGCGCTCC
GGTGCACTGA ACACGGCCCA CCACGCGGAG AGCCTGGGCA GGGCCGTCGG TGCCGTTCCC
GGGTCCGTGC ATTCCGCAAA TTCCGCCGGG TGCCACCGGC TGATTCGGGA AGGGGGAGCC
GTCTGCGTCA CGGACGCCGG CGAAATCGCG GAACTTGCCT CTTCCAGCGG CGAATCGTTG
GCAGACGAGG CCCCGGCACA GAGTGCCGAT CATGACGGCC TTACCCTGGA GGACCTCATC
CTCCTCGACG CACTGCCGCT CCGATCCACC AGCTCCGTCG AAAAGCTGAC GTCGGTCGCG
GGACTGAGTA CTGACGCGGT CAGGGCCGGC CTGGGCAGGC TGGGGTTGCT GGGGCTTGCC
GAATCCGAAC GCGGCGGCTG GAAACGGTCC CGGAAAGCCG GCTGA
 
Protein sequence
MTDHERIARA ALSRLMEPQD AVGLALVRTA GAVDGLRIAT GQLVSGPQLE QEVTALLAEN 
GTASWPGMSA SLRRWAPRIP DLAPERDLAT MHRLGGRMIM PSDSLWPGQL ADLDLHEPIC
LWWRGTEHPL PAAAKSIALV GSRDSTSYGA AVTGDLAYSL AQRGFTIVSG GAYGIDAHAH
RAALAGAGDA MPTIAVMAGG VDRFYPSGNE ELLRTVANQG AVLAEVPPGS APTRYRFLQR
NRLIAALSSV TVVVEARWRS GALNTAHHAE SLGRAVGAVP GSVHSANSAG CHRLIREGGA
VCVTDAGEIA ELASSSGESL ADEAPAQSAD HDGLTLEDLI LLDALPLRST SSVEKLTSVA
GLSTDAVRAG LGRLGLLGLA ESERGGWKRS RKAG