Gene Achl_2378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2378 
Symbol 
ID7293851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2668872 
End bp2670203 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content66% 
IMG OID643590785 
Productprotein of unknown function DUF21 
Protein accessionYP_002488432 
Protein GI220913123 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000393714 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGTGGC TACTCCTGGC GGCAGGGCTC CTGCTGATCG CCGGCACAGG CTTTTTCGTC 
GCCGTCGAAT TCTCCCTGAT TGCCCTCGAC CAGCCCACGG TCCAGCGGGC TGTCGACGCC
GGTGACGCCG GCGCCGTTCC CCTCCTTACC TGCCTCAAAT CGCTCTCCAC ACAGCTCTCC
AGCTGCCAGT TGGGCATCAC CCTCACCACC CTGCTCACCG GCTACGTCAT GGAACCATCG
GTGGGCCGCC TCCTGGAGGG GCCGCTGACG GCCCTTGGCC TGCCGGAGGT AGCTGCAGCA
TCAATTTCGC TGATCCTCGC CATGGTGCTG GCAACCCTGT TGTCGATGCT CCTTGGCGAA
CTCGTTCCCA AGAACATGGC CATCGCGTTG TCCTTCCCCG TCGGCAAAGC CCTGGCCAGG
CCGCAACTGA TCTTCACCGC GGTCTTCAGG CCGGCCATCG TGGTCCTCAA CGGCTTTTCC
AACCGGGTGC TCCACATCTT CGGGCTCGAA GCCAAGGAAG AGCTCTCCGG CGCGCGCACG
CCGTCCGAGC TGGCGTCACT GGTGCGCCGC TCGGCTGCGA TGGGAACGCT CGACGCCGGT
ACGGCCAACT TCGTGTCCCG CACCTTGAAT TTTTCCTCCA GGACCGCGGC CGACGTCATG
ACGCCGCGCA TCCGGGTGGA AATGATCGAC GCGGACCAGC CGGTCTCCGA CATCGTTGAC
GCGGCGCGCC GTACGGGATA CTCACGGTTC CCCGTGATCG GCGACTCTGC GGACGACATC
AAAGGCCTGG TCCACGTCAA GAAGGCCGTG GCCGTGCCCT CGGACAGGCG GCACAAGCTG
GAAGCCGGTG CCATCATGAC CGAAGTCCTC AGGGTTCCCG AGACCATCCA CCTTGACGCC
CTGCTGGCGG AACTCCGCGA AGGCAACCTC CAGCTGGCGG TGGTCCTCGA CGAATACGGC
GGCACCGCCG GCATTGCCAC GCTCGAAGAC CTGGTCGAGG AAATTGTGGG CGAGGTAGCC
GACGAACACG ACAAGGTGCG CCCGGGGCTG CTGCAGAGCG CCTCCGGGGA CTGGTATTTC
CCGGGACTCC TTCGCCCGGA CGAGTTGTCC GAGCAGATCC CGGGCCTGAC CGTCCCGGAC
GAAGCAGCCT ACGAAACCGT GGGAGGCTAC GTGATGAGCA AACTGGGCAG GATCGCGGCG
GTAGGGGACA CGGTGGCCGT GGACGGCGGC ACGCTGAGCG TTACCCGGAT GGACGGGCGC
CGCATCGACC GTATCTGCTT CCGGCCGGCT GCCCCTGAGC CGGACGGCAA CAACGACGGG
AGCCCATCAT GA
 
Protein sequence
MEWLLLAAGL LLIAGTGFFV AVEFSLIALD QPTVQRAVDA GDAGAVPLLT CLKSLSTQLS 
SCQLGITLTT LLTGYVMEPS VGRLLEGPLT ALGLPEVAAA SISLILAMVL ATLLSMLLGE
LVPKNMAIAL SFPVGKALAR PQLIFTAVFR PAIVVLNGFS NRVLHIFGLE AKEELSGART
PSELASLVRR SAAMGTLDAG TANFVSRTLN FSSRTAADVM TPRIRVEMID ADQPVSDIVD
AARRTGYSRF PVIGDSADDI KGLVHVKKAV AVPSDRRHKL EAGAIMTEVL RVPETIHLDA
LLAELREGNL QLAVVLDEYG GTAGIATLED LVEEIVGEVA DEHDKVRPGL LQSASGDWYF
PGLLRPDELS EQIPGLTVPD EAAYETVGGY VMSKLGRIAA VGDTVAVDGG TLSVTRMDGR
RIDRICFRPA APEPDGNNDG SPS