Gene Arth_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4215 
Symbol 
ID4443581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp47332 
End bp48888 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID639687740 
Producthypothetical protein 
Protein accessionYP_829437 
Protein GI116662384 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGCA TGAAACTCTT CACCCGCCCC GCCAAGAAGG CCGCAGCACC GGTCAAGAAT 
GCGCCGAAGA AGAAGGCGGA GAAGGAACAG CGTCAGCCCG GGCCCTCTGC CCGGGGCTGG
ACCGGACGCG GCGGCGGCAT GGCCGCCCTC GTCCCCTCCG TGAAGGAATA CCGGGGAACC
ACGGTCCAGG TGTGCGGGCT GTGGCCGTTT TCCTCCGGGG CCTCCTCACC CATGATCGGC
GTCCCGCTCG GACGCCACGA GGAAACCCAG GCCACGGTCT GCTGTGACCC GATCAGCTGG
TTCCAGCGGG CACGCCTGAT TTCCAATCCG TCTGCGTTCA TCCTGGGCAA GCCGGGGCTG
GGAAAGTCCA CCGTGGTGCG CCGGATGTTC ATCGGTCTCT CCGCTCAGGG CGTCCACCCT
CTGATCCTGG GGGACTTGAA AGGCGAGCAC GTCAAAGCGG TCCAGGCACT CGGCGGACAG
GTCATCAAGC TCGGCCGCGG CGTCGGGTAC CTGAACATCC TCGATCCCGG CCAGGCCGTC
GAAGCGGCGC AGCTGCTCGA AGCCAGCGGC CATCACGAGG ACGCCGCCCG GGTCCGCGCC
GACGCGCACG GCCGTCGCCT GACAATGGTT GTCTCCCTGA TCACGATCAG CCGGAACAAC
CCGCCCACCG ATCAGGAACA GACAATCCTG GACCGGGCCT TGCGGGTCCT CGATGACCGG
TTCGACGGTG TTCCTGTCCT CAAAGACCTC CTGGACGTGA TCATCTCCGC ACCCGACGAG
CTGCGGCAGG TCGCCTTGGA CCGTGGGGAC ATGGCCGTCT ACCTGCAGGA GACCCGGGCA
CTGGAAGCAA CCCTGCTGGG CCTGACCGGT GGCGGGAAGC TGGGGGAAAT CTTCTCCCGG
CACACCACCA ACCCCATGAT GCGCGACCGC GCCGTGGTCT TCGACGTCTC CAGCATTGAC
GAAACCGAAA CCGACCTCCA AGCTGCAATC CTGCTCGCGT GCTGGTCCTA CGGCTTCGGC
ACCGTCAACG TCGCCAACGC CCTCGCGGAC GCCGGCCTGG AACCGCGCCG AAACTACTTC
GTCGTCCTCG ATGAACTCTG GCGGGCGCTG CGCGCCGGTA AAGGCATGGT GGATCGGGTG
GACGCCCTGA CCCGGCTCAA CCGCTCCGTC GGTGTCGGGC AGATCATGAT CTCCCACACC
ATGTCCGACC TGCTGGCACT GCCGGCAGAA GAGGACCGGA TGAAAGCCCG CGGCTTCGTG
GAACGCTCCG GCATGGTCAT CTGCGGTGGC CTCCCGGCCT CCGAGATGCC GCTGCTGACC
TCGGCCATCC CGTTGTCCCG GCAGGAACAG CAGAAGCTCA TCTCCTGGCA GGACCCGCCC
GCATGGGACT CCCGCGGGCT CGACGTCGAA CCCCCCGGAC GCGGGAAGTT CCTGATCAAG
GTCGGCGGCC GTCCCGGCAT CCCCGTCCTG GTGGGCCTGA CGTCCCTCGA ACAGGGCCCG
GACGGAGTCA ACGACACCGA AGGAAAGTGG CATGAGAACG TTGAGAAGTT CGCGTGA
 
Protein sequence
MARMKLFTRP AKKAAAPVKN APKKKAEKEQ RQPGPSARGW TGRGGGMAAL VPSVKEYRGT 
TVQVCGLWPF SSGASSPMIG VPLGRHEETQ ATVCCDPISW FQRARLISNP SAFILGKPGL
GKSTVVRRMF IGLSAQGVHP LILGDLKGEH VKAVQALGGQ VIKLGRGVGY LNILDPGQAV
EAAQLLEASG HHEDAARVRA DAHGRRLTMV VSLITISRNN PPTDQEQTIL DRALRVLDDR
FDGVPVLKDL LDVIISAPDE LRQVALDRGD MAVYLQETRA LEATLLGLTG GGKLGEIFSR
HTTNPMMRDR AVVFDVSSID ETETDLQAAI LLACWSYGFG TVNVANALAD AGLEPRRNYF
VVLDELWRAL RAGKGMVDRV DALTRLNRSV GVGQIMISHT MSDLLALPAE EDRMKARGFV
ERSGMVICGG LPASEMPLLT SAIPLSRQEQ QKLISWQDPP AWDSRGLDVE PPGRGKFLIK
VGGRPGIPVL VGLTSLEQGP DGVNDTEGKW HENVEKFA