Gene Achl_2377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2377 
Symbol 
ID7293850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2667805 
End bp2668875 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content64% 
IMG OID643590784 
Productprotein of unknown function DUF21 
Protein accessionYP_002488431 
Protein GI220913122 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000000976861 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGACT GGGCCGGCAT CGTCTGGCTG GCGTTCCTGC TGCTCGGCAA CGCGTTCTTC 
GTGGCGGCCG AGTTCGCCAT CATGTCCGCC CGGCGCAGCC AGATCGAACC CTTGGCGGAG
GCCGGGTCGA AGCGGGCCCA AACCACCCTG AAGGCGATGG AGAACGTCTC GCTGATGCTG
GCCTGCGCGC AGCTTGGCAT CACGGTGTGC TCGCTGCTGA TCCTGCTGGT GGCCGAGCCT
GCGATCCACC ACCTCCTGGC CGCGCCGCTG GAGCTGGTGG GCCTGCCCGT GGAAGTGGCC
GATGTTGCCG CGTTCGCTGT GGCCCTGATG TTCGTCACGT TCCTGCACGT CACCTTTGGC
GAGATGGTGC CCAAGAACAT CTCGGTATCG GTCGCTGACA AGGCGGCGAT GTTCCTGGCC
CCGCCGCTGG TCTTCGTGGC ACGGCTGGTC CACCCCGTCA TCTCGGTGCT GAACTGGTCG
GCGAACCATA TCCTCAAGCT GTTGCGGATC GAGCCCAAGG ACGAGGTCAA CTCGTCCTTC
ACGCTCGAGG AGGTCCAGTC CATCGTGCAG GAATCCACCC GGCACGGACT GGTGGACGAC
GACGCCGGCC TGATCACCGG CGCCCTTGAG TTCTCCGAAT ACACGGCTGG AGACATCATG
GTTCCGCTGG ACAGCCTGGT CATGCTCAAG GCTGCGACTA CTCCGGTGGA GTTTGAAAAG
GCTGTCAGCC GCACGGGTTT TTCCCGGTTC CCCATGCTGG ATGAGGACGA TCTCCTGTAT
GGCTACCTGC ACGTCAAGGA TGTGCTGTCC ATCCCTCCGA CGGCGTACGA GCTGCCCATT
GCGGAAAGCC GCGTCCGTTC CCTGGCCAAC CTGGCCCTGG GCGATGAAAT CGAAAAGGCC
ATGTCCGTCA TGCAGCGGAC CGGCTCGCAC CTTGCCCGCG TCATCGGCAA GGACGGCAAT
ACCCAGGGCA TCCTGTTCCT CGAGGATGTC ATTGAACAAC TCGTCGGCGA GATCCGGGAC
GCTACCCAGG CCACCGGCAT CCGACGGCTG GGGCAACCCA ACGGGGGATA G
 
Protein sequence
MSDWAGIVWL AFLLLGNAFF VAAEFAIMSA RRSQIEPLAE AGSKRAQTTL KAMENVSLML 
ACAQLGITVC SLLILLVAEP AIHHLLAAPL ELVGLPVEVA DVAAFAVALM FVTFLHVTFG
EMVPKNISVS VADKAAMFLA PPLVFVARLV HPVISVLNWS ANHILKLLRI EPKDEVNSSF
TLEEVQSIVQ ESTRHGLVDD DAGLITGALE FSEYTAGDIM VPLDSLVMLK AATTPVEFEK
AVSRTGFSRF PMLDEDDLLY GYLHVKDVLS IPPTAYELPI AESRVRSLAN LALGDEIEKA
MSVMQRTGSH LARVIGKDGN TQGILFLEDV IEQLVGEIRD ATQATGIRRL GQPNGG