Gene Achl_4478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_4478 
Symbol 
ID7280046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011879 
Strand
Start bp413854 
End bp415038 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID643580432 
Productprotein of unknown function UPF0027 
Protein accessionYP_002478246 
Protein GI219883082 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.00757416 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCCCCG TCGAAATGCG CGGCACAGCC CGTCCCGTTC ACCTCTGGGC GCACGAGCAC 
GAGGTTGAGC CCGCTGCCCT GCAGCAGCTG CGCAACATCG CCTCCATGGA ATGGGTTCAT
GGCGTCCGTG TAATGCCCGA TGTCCATTTG GGCAAGGGGG CCACGGTCGG CTCGGTCATC
GCCATGAAGC AGGCAGTATC GCCGTCCGCC GTCGGCGTGG ACATCGGCTG CGGCGTCTCC
GCCGTCAAGA CATCACTGAC CGAGAACGAC CTGGACAACC TGCACGCGCT CCGGCTCGCC
ATCGAATCCG CCATCCCGGT CGGCTTCAAC TCCCACAGCC GTGACGTGAA CCTGAAGCGC
CTCGGCCTCG AGCGCGGTGC CAAGACGTTC TGGGACGGGT TCAAGGACCT CCACCCTGCA
GTGCAGAAGC TGGAATCCCG TGCCCACTCC CAGCTCGGAA CCTTGGGCGG CGGAAACCAC
TTCATCGAAG TCTGCGTCGA CGAGGCAGGC GCCGTCTGGC TGACCCTCCA CTCAGGCTCC
CGGAACGTCG GCAAGTCCCT CGCTGAGGTG CACATCGACA TCGCCAAGGG CCTGAGCCAC
AACAACGGCA TCGTCGATAA GGACCTGGCC GTGTTCCTGG CCGGCACACC CGAAATGGAC
GCCTACCGCC GTGACCTGTG GTGGGCTCAG GACTACGCCG CCCGGTCCCG CTCGGTGATG
ATGGGCCTGT TCAAGGAGCA GGTCGCCAAG CACTTCGCGA CGGCGAACGT CACGTTCGGC
GAGGAGATCA ACGTCCACCA CAACTATGTC TCCGAGGAGA TCATCGACGG CGAACTCATG
CTGGTCACCC GCAAGGGCGC CATCCGGGCT GGCAAGGGAA ACCTGGCATT GATCCCCGGC
AGTATGGGCA CGGGCAGCTA CGTTATTCGC GGCCGCGGGA ACGACGCATC CTTCCAGTCC
GCTTCCCACG GGGCTGGGCG GAAGATGAGC CGAAATGCGG CCAAGAAGGT GTTCACGGTC
GATGACCTGA TTGCCCAGAC CGCCGGAGTC GAGTCCCGCA AGGACCAGGC CATCGTCGAC
GAGATCCCCG GTGCGTACAA GGACCTGCAC AGCGTCATCG ACGCCCAGAA GGACCTGGTA
GACGTCGTCC AGCACCTGCG GACTGTCCTC TGCGTGAAAG GCTGA
 
Protein sequence
MFPVEMRGTA RPVHLWAHEH EVEPAALQQL RNIASMEWVH GVRVMPDVHL GKGATVGSVI 
AMKQAVSPSA VGVDIGCGVS AVKTSLTEND LDNLHALRLA IESAIPVGFN SHSRDVNLKR
LGLERGAKTF WDGFKDLHPA VQKLESRAHS QLGTLGGGNH FIEVCVDEAG AVWLTLHSGS
RNVGKSLAEV HIDIAKGLSH NNGIVDKDLA VFLAGTPEMD AYRRDLWWAQ DYAARSRSVM
MGLFKEQVAK HFATANVTFG EEINVHHNYV SEEIIDGELM LVTRKGAIRA GKGNLALIPG
SMGTGSYVIR GRGNDASFQS ASHGAGRKMS RNAAKKVFTV DDLIAQTAGV ESRKDQAIVD
EIPGAYKDLH SVIDAQKDLV DVVQHLRTVL CVKG