Gene Arth_3844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3844 
Symbol 
ID4447596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4325061 
End bp4326662 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content66% 
IMG OID639691668 
Productradical SAM domain-containing protein 
Protein accessionYP_833319 
Protein GI116672386 
COG category[R] General function prediction only 
COG ID[COG1964] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCGT CCTCGTTCTC TTCCCGGACG CCGCTTGGAC CTGGCCAGCC GCTGCGCGGC 
GACCGCATCC ACCGGTATGT CACGGCGTTC TGCCCCCGCT GCCACGAGAC GAATCCCCCG
CTTGCCCAGG TGCGGCGCCT GTCCGGGGCA CTGCTGATCC GTGACGACCG CGTGTGGCTC
GAACGCGGCT GCCCCGACCA CGGACTGGTG CGCACGCTGT ACGACGAATC GCCCGAAATC
CTGCGCTACC TGGAGAAATG GCAGGCGCCC ACCAAGCAGC ACATCCCCGA CCAGGCGGAC
AACTTCCGCC CGGTACCCGA GGCGTACGCC TACGGGCTTC CCGCCATGCA GACGCAGCAC
ACGTGCATCC TTCTGCAGGA CATTATCGAG CACTGCAATC TACGCTGCCC CACTTGCTTC
ACCTCCTCCG GCCCGCAGCT GCAAGGCGTG GCGCCGCTCG CCGAGGTGCT CGCGAACGTC
GATGCACGGC TTGCCCGCGA GAACGGGCGC CTTGATGTGC TGATGCTCTC CGGCGGCGAG
CCCACGCTCT ACCCTCATCT CGCCGAACTA CTGGAGGAAC TCGTCGCACG GCCGATCGTC
CGGATCATGG TGAACAGCAA CGGCATGCTG ATGGCGACCG ACGACGAGCT GCTCGCCCTC
CTGGCGAAGC ATCGAGACCG GGTGGAGGTG TACCTCCAAT ACGACGGACC CTCCAAGGAG
GCATCGATTC ACCACCGCGG AGGCGATCTG ACGCGCTTCA AGGATGCGGC GATCTCGCGC
CTGTCCGAGG CGGGCGTCTT CACGACCCTG ACGATGACAG CAACCCTCGG CGTCAACGAC
GGCGAAGTGG GCGCCGTCGT CATGCGCGCC CTGGAAACCC CGTTCGTGGG CGGAGTCGCA
CTCCAGCCGG TGTTCGGTTC CGGCCGCGGA CACGGAATCG ATCCCATGGA CCGGCTCACC
CACACGGGCG TCCTCGAAAG GCTGGCGGAG CAAACCAACG GGGTTGTCTC GTGGCACGAC
CTGACAGCCC TCCCTTGCTC GCATCCGCAC TGCGCCTCCG TCGGCTACAT GCTGAAGGAC
GACTCCGGCG TCTGGCGGTC GCTGACCGCC CTCATCGGGC ACGACCAGCT ACTCGCCTGG
CTGGAGCTCA ATCCTGACAG CCTCGCCAAC CGCATCGCAG ACAGAGCAAT CCCATTGGAG
CTGCGCAGCC TGATGAAGTC CTCGCTGCTC GATCTGCTGA GCGAACAGTC GTCGCTTTCC
CACCCGCGGA CAGCGGATCT CTGGAAGAAC ATCTGCACTC AGTGCGACCT CGGGATAGGC
ACGCTCACCA CGCTGGCGGC CGGCAAGCTC CCCGGTCAGC AGCAACGGCT GCGCAGGCTC
CTGGCCGAAC GCGTCACACG GATCATGGTC AAGCCATTCA TGGACATTTC GACCATGATC
GAGGAGAGGC TCACCCAATG CTGCGTGCAC GTCGGAACCA AGAGTGACGC AGGCGATCAC
CAGTGCGCCC CCTTCTGCGC CGTCCAGGCG TGGCCTGCCC TGTCCAGACA ACGGATGAGC
ACGGCCACGG GCGTGCAGCT GCTCCCGGTG CGCCAACTCT GA
 
Protein sequence
MNPSSFSSRT PLGPGQPLRG DRIHRYVTAF CPRCHETNPP LAQVRRLSGA LLIRDDRVWL 
ERGCPDHGLV RTLYDESPEI LRYLEKWQAP TKQHIPDQAD NFRPVPEAYA YGLPAMQTQH
TCILLQDIIE HCNLRCPTCF TSSGPQLQGV APLAEVLANV DARLARENGR LDVLMLSGGE
PTLYPHLAEL LEELVARPIV RIMVNSNGML MATDDELLAL LAKHRDRVEV YLQYDGPSKE
ASIHHRGGDL TRFKDAAISR LSEAGVFTTL TMTATLGVND GEVGAVVMRA LETPFVGGVA
LQPVFGSGRG HGIDPMDRLT HTGVLERLAE QTNGVVSWHD LTALPCSHPH CASVGYMLKD
DSGVWRSLTA LIGHDQLLAW LELNPDSLAN RIADRAIPLE LRSLMKSSLL DLLSEQSSLS
HPRTADLWKN ICTQCDLGIG TLTTLAAGKL PGQQQRLRRL LAERVTRIMV KPFMDISTMI
EERLTQCCVH VGTKSDAGDH QCAPFCAVQA WPALSRQRMS TATGVQLLPV RQL